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THE LIFETIMES OF POSITRONS IN METALS! 
G. JONES? AND J. B. WARREN 


ABSTRACT 


The lifetimes of positrons annihilating in some 21 metals have been measured 
comparatively with the lifetime in aluminum. Significant variations have been 
found, as large as 30% of the absolute lifetime in aluminum, and the lifetime 
shows a more marked dependence on the atomic density than on the valence 
electron density of the metal. By comparison of lifetime and angular distribution 
data a mechanism of the annihilation process is proposed. In metals with large 
lattice spacings the annihilation is primarily with valence electrons, but slightly 
enhanced by pick-off annihilations with outer electrons of the ions; as the spacing 
decreases the pick-off probability increases, resulting in larger high momentum 
tails to the parabolic angular distributions and shorter lifetimes. The annihila- 
tion rate increases with valence electron density but levels off at higher densities 
at a rate similar to the theoretical value for positronium ions. With very reduced 
lattice spacing the annihilation with ionic electrons becomes comparable with 
that with valence electrons, so that bell-shaped angular distributions and shorter 
lifetimes result. 


1. INTRODUCTION 


Although the lifetimes of positrons in various metals have been the subject 
of several investigations (De Benedetti and Richings 1952; Bell and Graham 
1953; Ferguson and Lewis 1953; Bell and J¢rgensen 1960; Bisi et a/. 1960), 
there still exist some discrepancies between the experimental results and the 
theoretical interpretations. Following the original suggestions of De Benedetti 
et al. (1950) and Bell and Graham (1953), most of the theoretical descriptions 
of positron annihilation in metals have considered the lifetime to be a function 
of the valence electron density due to Dirac two quantum annihilation of the 
positrons with these free electrons. The measurements, however, indicated a 
constant lifetime of about 2.0 X 10-" sec, independent of the metal. As a result 
of these measurements, an alternative mechanism for annihilation was pro- 
posed by Bell and Graham (1953). They suggested that positron annihilation 
occurred from a bound positron-electron state with its own characteristic 
lifetime, but the observed lifetime appeared to be too short (Garwin 1953). 
Experimental measurements of the angular correlation of the annihilation 
gamma rays (Stewart 1957; Lang and De Benedetti 1957), however, strongly 
suggested the interpretation in terms of the direct annihilation picture. 
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Subsequent theoretical work by Ferrell (1956) then showed how the theory 
of direct positron annihilation with free electrons could be modified by con- 
sidering the effect of screened Coulomb interactions between the positron and 
the Fermi electrons. In this way the expected lifetime variation from metal 
to metal was drastically reduced and lay within the experimental errors of 
the early measurements (De Benedetti and Richings 1952; Bell and Graham 
1953). 

The recent papers of Jones (1959), Bell and J¢rgensen (1960), and Bisi et 
al. (1960) have reported the measurement of significant lifetime variations. 
Bell and J grgensen (1960) measured the absolute positron lifetime in aluminum 
and the alkali metals, the results indicating an increased lifetime in the alkali 
metals; for example, where a lifetime of about 2X10—"sec was found to 
characterize the decay in aluminum, 5X10~" sec was observed for cesium. On 
the other hand, Bisi et al. (1960) performed comparative lifetime measure- 
ments, and detected significant but small variations among a number of the 
chemically stable metals. 

The measurements reported here are a more detailed discussion of our 
results reported earlier (Jones 1959a). 


2. METHOD 

Using a time sorter of high stability (Jones 1960), the lifetimes of positrons 
in a number of chemically stable metals were compared with the lifetime in 
aluminum. The technique used was that described by De Benedetti and Rich- 
ings (1952), whereby one positron source is used in a comparison of the life- 
times in various metal absorbers. There are several distinct advantages to 
such a technique. 

(i) The positron source (Na in this case) need not be moved when changing 
the absorbers. Thus no timing errors due to variations in the gamma-ray flight 
distances are involved. 

(ii) Similarly, the use of the same source means that the results are free 
of any count-rate effects, a factor of major concern when sources of various 
strengths are employed. 

(iii) Again, the use of the one source frees the measurements of the energy- 
dependent instrumental time shifts (Post and Schiff 1950; Bell and Graham 
1953) which plague measurements involving a comparison of gamma-ray 
sources which have different energy spectra. Largely as a result of these 
advantages, the measurements of the positron lifetimes were obtained with a 
relative accuracy of 0.1 X10-"° sec, a value of about 5% of the absolute lifetime 
in aluminum. 


3. EXPERIMENTAL PROCEDURE 
The experimental procedure involved in these measurements is described 


in detail by Jones (1959d). 
The ranges of the positrons in the various metals were obtained for a limited 
number of metals from the measurements of Seliger (1955), and estimated for 
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the rest from the calculations of Nelms (1956). Metal foils significantly thicker 
than these ranges were then employed as absorbers. For many of the metals, 
typical foil thicknesses lay between .010 and .020 inch. 

A series of coincidence resolution curves of about 1} hours’ duration were 
then obtained in alternate fashion for each pair of positron absorbers (one of 
the pair being the aluminum reference). The metal foils for the absorbers 
were mostly of 99.9% purity. 

4. RESULTS 

The difference in the mean lifetime of the positrons annihilating in a parti- 
cular metal compared with that in aluminum was obtained by determining 
the difference between the means of the first moments of each set of coincidence 
resolution curves. These moments were obtained from a numerical analysis 
using the U.B.C. Alwac IIIE digital computer. An example of the results so 
obtained are shown for the metals silver, nickel, germanium, and tungsten 
in Fig. 1. The statistical scatter shown for these values results in a standard 
deviation for the first moments of each metal of about 0.1X10-" sec. The 
centroids (first moments) shown are those calculated from the “‘raw”’ experi- 
mental data. It was unnecessary to subtract the very small random coincidence 
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Fic. 1. Comparative lifetime measurements for nickel, germanium, silver, and tungsten. 
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rates because, for measurements of equal time duration, the background con- 
tributions to the first moments would be the same for both absorbers, and so 


would cancel out when taking differences. 
The results for all the metals investigated are given in Table I. The corre- 


TABLE | 


Comparative lifetime values 


Lifetime difference: (741—Tmetal) X 107!" sec 








Atomic — ———— —_——- 
Metal number Bisi et al. (1960) Jones and Warren 

Titanium 22 + .04+ .08 
Vanadium 23 + .24+ .05 
Iron 26 + .573+ .035 + .33+4 .07 
Cobalt 27 +.31+4.06 
Nickel 28 + .607+ .055 +.29+ .05 
Copper 29 + .283+ .055 +.07+.13 
Zinc 30 +.214+ .062 + .30+ .20 
Germanium 32 — .31+.05 
Zirconium 40 + .25+ .22 
Molybdenum 42 + .704+ .055 +.44+ .08 
Palladium 46 + .200+ .062 +.07+.10 
Silver 47 +.179+ .035 +.10+.10 
Cadmium 48 +.173+ .048 + .04+.06 
Indium 49 — .131+.090 + .08+.10 
Tin 50 +.124+ .048 —.10+.08 
Gadolinium 64 — .38+.10 
Tantalum 73 + .400+ .048 + .38+4.14 
Tungsten 74 + .373+ .035 +.39+ .10 
Gold 79 + .311+.055 — .08+ .05 
Lead 82 — .097+ .048 +0 +.13 

83 — .297+ .027 — .08+.10 


Bismuth 


| 


sponding values from the letter of Bisi e¢ al. (1960) are also given for the 
sake of comparison. The numerical values are given in the form: 


Ar+ (troar/Vn) ’ 


where Ar is the mean value for the increase in the mean lifetime of positrons 
in aluminum over that in the sample metal, 

troa,/+/n is the estimate of the standard error of the mean Az, with oa, 
the standard deviation of the distribution of the estimates of Ar, i.e. 
Ox, = o1+03, oj and o> being the variances of the two distributions character- 
izing the two metals, 

n is the number of resolution curves obtained for each metal, and ¢, is 
determined from Student’s ¢ distribution. 

The errors quoted for the results tabulated in Table I are purely statistical 
in nature. The following possible sources of systematic errors were also con- 
sidered: 

(i) Errors in time calibration. An error in the time calibration of +2% 
would introduce an error of +.01X10- sec in a measurement of a lifetime 
difference of 0.5X10- sec, an error at least five times smaller than the 


statistical ones and so neglected. 
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(ii) Variations in the positron flight time from the source to the absorber. 
Since not all the absorber surfaces were perfectly planar, slight variations in 
the flight time of the positrons between the source and the absorber due to 
the small variations in the distance travelled might be expected to introduce 
some error in the results. To determine the magnitude of such time variations, 
a series of measurements were performed comparing the resolution curves 
resulting from the use of two aluminum absorbers identical in all physical 
characteristics except that one possessed a .016-in. spacer around its periphery 
to maintain it at a distance from the source. The results of these measure- 
ments indicated an increased mean lifetime for the ‘‘distant’’ absorber of 
(.05 X .06) X 10-"° sec. For the magnitude of the variations found in the actual 
source—absorber distances due to surface irregularities, then, this source of 
error was insignificant compared with the statistical one. 


5. MISCELLANEOUS MEASUREMENTS 


As a check of the internal consistency of the results, several additional 
comparative measurements were performed. 

(i) Since both the iron and nickel.samples yielded lower lifetimes for the 
absorbed positrons than did the aluminum, a comparative measurement 
between these two metals was made. The mean positron lifetime in iron was 
found to be greater than that in nickel by (.08+.08) X10- sec. This value 
is in agreement with those of Table I which indicate that the same lifetime 
characterizes both metals, viz. a value (0.30+0.10) X10-" sec less than that 
in aluminum. 

(ii) Since both the iron and cobalt samples were formed from sintered 
powders, rolled to the desired thicknesses, a test to check the effect of sample 
structure was performed. In this case, the mean lifetime of positrons in the 
sintered powdered iron absorber was compared with that in a sample of 
welding steel. The steel tended to exhibit a somewhat shorter lifetime than 
the iron sample (0.12+.08)X10- sec, but, within experimental error, no 
significant dependence of the positron lifetime could be attributed to this 
type of structural variation. 


6. DISCUSSION 

Measurements of the relative lifetimes of positrons in a number of metals 
(about 21 in all) as compared with that in aluminum indicated the existence 
of statistically significant variations of the lifetimes from metal to metal, 
these variations of up to 0.7X10~-" sec amounting to about 30% of the life- 
time in aluminum. The comparative measurements described here are in fair 
agreement with those of Bisi et a/. (1960), the magnitudes of the variations 
agreeing in nine cases within 1} experimental errors. The results for Fe, Ni, 
Mo, and Au are in disagreement. 

The detected variations in lifetime are within the experimental limits of 
about 0.5X10-' sec determined by previous workers (Ferguson and Lewis 
1953; De Benedetti and Richings 1952; and Bell and Graham 1953). However, 
the present measurements fail to substantiate the tendency noted by Bell 
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and Graham (1953) for shorter mean lifetimes to occur for those metals of 
higher atomic number. 

As noted in the Introduction, most theoretical treatments of the annihila- 
tion mechanism in metals predict a dependence of the positron lifetime on 
the density of valence electrons. Such a theory has proved very useful in 
interpreting the angular correlations of the annihilation gamma rays from 
metals (Stewart 1957; Lang and De Benedetti 1957). Many metals, in fact, 
yield an angular correlation distribution which is essentially an inverted 
parabola, with a low intensity tail ranging to larger angles. Such a distribution 
is easily described in terms of direct annihilation between thermalized positrons 
and the electrons of a Fermi gas. 

Lang and De Benedetti (1957) separated their metals into three categories, 
depending on the relative intensities of the parabolic distributions compared 
with the larger angle tails. Those metals exhibiting a strong central parabola 
with very small tails were classified as group A; those with larger tails but 
which still retain well-defined central parabolas were termed group B; and 
those with tails so large that any separation into two distinct components 
became very artificial were designated group C. Thus, when investigating the 
lifetime dependence on valence electron density for our results, only those 
metals of groups A and B were considered. These two groups are the ones 
for which an interpretation in terms of annihilation with electrons of a Fermi 
gas appears to be most sound. 

Figure 2 is a plot of the observed positron annihilation rate based on our 
value of the absolute lifetime in aluminum of (2.45+0.15)X10- sec as a 
function of the valence electron density. The errors indicated are those of 
the comparative measurements only. The solid line is the ‘‘enhanced” Dirac 
annihilation rate for these metals according to the calculations of Ferrell 
(1956). The results illustrated in this figure fail to indicate any significant 
dependence of the positron annihilation rate on the density of the valence 
electrons. 

The possibility of the positron lifetime depending on other characteristics 
of the metals was also considered. Figure 3, as an example, illustrates the 
dependence of the positron annihilation rates on the mean interatomic spacing, 
i.e. the cube root of the reciprocal of the atomic density, for all the metals 
investigated. It would appear from this figure that a reduced positron life- 
time (hence increased annihilation rate) results for those metals characterized 
by a small interatomic spacing. If the positron annihilation rate is in fact 
dependent on the space available within a metal, a plot of the annihilation 
rate versus the lattice constant of the metal should be more significant, since 
the space available within a crystal lattice is a function both of the mean 
interatomic spacing and the type of crystal structure. Such a dependence is 
illustrated in Fig. 4, where the experimental positron annihilation rates are 
plotted as a function of the lattice constant for those metals of cubic sym- 
metry (body-centered cubic and face-centered cubic). The values for the 
lattice constants used in Fig. 4 were obtained from Seitz (1940). This diagram 
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Fic. 2. Dependence of annihilation rate on valence electron density for those metals which 
give inverted parabolic angular distributions of annihilation radiation in good accord with 
theory based on annihilation with conduction electrons. 

Fic. 3. Dependence of annihilation rate on mean atomic spacing. 


suggests a quite definite dependence of the positron annihilation rate on the 
size of the metallic crystal lattice. 

Also of interest is the apparent dependence of the shape of the angular 
correlation function of the annihilation gamma rays on the size of the crystal- 
line lattice. Those metals of groups A and B which are described by Lang 
and De Benedetti (1957) as being most characteristic of positron annihilation 
with electrons of a Fermi gas appear to be those with large lattice spacings. 
Those metals, on the other hand, which exhibit shorter positron lifetimes and 
bell-shaped angular correlation functions are those metals with smaller lattice 
spacings. 

Several other possible forms of lifetime dependence were also considered 
for these metals. In particular, the lifetime dependence on both the fractional 
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Fic. 4. Dependence of annihilation rate on lattice spacing for metals with body- and 
face-centered cubic structure. The lifetime in aluminum has been taken to be 2.45107" sec. 


ionic volume (defined by the ratio of the total ionic volume in a unit cell to 
the volume of the unit cell itself) and the average density of the outer metal 
ion core electrons (averaged over the whole volume within the metal) were 
examined. Neither yielded as strong a dependence as that of Fig. 4. For the 
metals in this figure, a graph with somewhat less scatter resulted when the 
annihilation rates were plotted as a function of the lattice constant minus 
the metal ion diameters (also obtained from Seitz 1940). The scatter in the 
experimental points tended to be reduced for all the metals concerned except 
the three, copper, silver, and gold, which metals exhibited a more marked 
deviation in fact than shown in Fig. 4. 

The observed increase in both the annihilation rate and the intensity of 
the high momentum component of the angular correlation distributions with 
an increase in the ‘‘packing’’ of the metal atoms in the crystalline lattice 
tends to support the view that the high momentum tail represents an increase 
in the annihilation rate over that characterizing the inverted parabola portion 
of the distribution. This could be produced by a mechanism similar to that 
described by Berko and Plaskett (1958) in terms of ‘‘core annihilations’”’. A 
similar view was expressed by Bell and J¢rgensen (1960) concerning their 
lifetime results for the alkali metals. 

If the above interpretation is assumed, then it is possible to relate the 
lifetime and angular distribution data more closely. Following the notation 
of Lang and De Benedetti (1957), the angular correlation functions of the 
groups A and B metals can be described in terms of the ratio, Ag/ Ap, of the 
areas of the wide Gaussian and parabolic parts of the distribution function. It 
is easy to show that the ratio of the areas is identical with the ratio of the 
areas of the corresponding portions of the momentum distributions plotted 
by Stewart (1957). Hence A,/ A, is equal to the relative probability of positron 
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decay resulting in the Gaussian, ‘‘high-momentum’’ component to that 
resulting in the parabolic distribution. 

Thus, if the total probability for annihilation, P,, can be represented in 
the form P, = P,+P,, where P, is the annihilation rate characteristic of the 
parabolic component, and P, is the annihilation rate for the Gaussian com- 
ponent, then P,/P, = A,/Ap, the ratio of the areas as measured by Lang 
and De Benedetti (1957), and P, = P,(1+A,/A,)~.* This annihilation rate 
has been plotted as a function of the valence electron density in Fig. 5. Values 
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Fic. 5. Dependence of annihilation rate on valence electron density. 

(i) The points marked © represent experimental values based on an assumed lifetime for 
positrons in aluminum of 2.0X10~sec (Bell and J¢rgensen 1960) and include Bell and 
peers (1960) results for the alkali metals together with the comparative lifetimes given 

ere. 

(ii) The full points @ on curve A are the values deduced for P-parabolic, the annihilation 
probability with electrons of the Fermi sea, using the values of Ag/Ap from Lang and De 
Benedetti (1957). 

(iii) Curve B indicates the position these P, points would occupy assuming a lifetime in 
aluminum of 2.45 X 107!" sec. 

(iv) Curve C shows Ferrell’s (1956) enhanced curve. 

(v) Curve D shows the annihilation rate from positronium atoms enhanced by a pick-off 
mechanism such that AP/AN = 1.5X 107" cm$ sec". 


of A,/A, were obtained from the data of Lang and De Benedetti. For the 
metals indium, potassium, and cesium which were not discussed in their 
paper, estimates of 4,/A) were made from the curves of Stewart (1957). In 
Fig. 5, the results of Bell and J¢rgensen (1960) are also plotted. The life- 
times of the dominant component (about 95%) observed by Bell and J¢r- 


*This argument is similar to that outlined independently by Bell and Jérgensen (1960). 
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gensen were the ones incorporated in this figure. For the sake of consistency, 
our comparative lifetime results are shown normalized to the absolute life- 
time obtained for aluminum by Bell and J¢grgensen. On the basis of our 
somewhat larger absolute lifetime determination for aluminum (of 2.45 K 10-" 
sec), the dashed line B results. 

This graph suggests, in fact, that after an initial increase in the positron 
annihilation rate due to the increased valence electron density, a levelling 
off of the annihilation rate results. 

The annihilation rate of the parabolic component at high electron density 
appears to reach a constant value near that calculated for the positronium 
ion which consists of a positron and two electrons (Ferrell 1956). Such a 
model would be compatible with the interpretation of the angular correlation 
results in terms of the momentum distribution of electrons in a Fermi sea 
only if a rapid electron exchange mechanism was involved. Since the ion is 
expected to have a very low binding energy even in free space (Ferrell (1956) 
suggests only 0.2 ev more than the positronium atom), it would not be un- 
reasonable to expect the electrons to be rapidly exchanged with the valence 
electrons of the metal. 


7. CONCLUSIONS 


A series of comparative lifetime measurements for positrons in various 
metals has yielded the result that significant lifetime variations do exist and 
amount to about 30% of the absolute lifetime of positrons in aluminum. 

It is found for the metals concerned that the positron lifetime shows a 
greater dependence on the atomic density of the metal than it does on the 
valence electron density. 

In addition, these results together with those of Bell and J¢rgensen (1960) 
suggest a mechanism for annihilation in terms of the following picture: 

(a) In a metal of large lattice spacing, positron annihilation can be charac- 
terized by annihilation primarily with the valence electrons. This annihilation 
rate is then enhanced by “‘pick-off’’ annihilations with the outer electrons 
of the surrounding metallic ions. Thus, the former process produces the 
parabolic portion of the angular correlation distribution, whereas the latter 
yields the Gaussian. Thus, as the lattice spacing is reduced, the ‘‘pick-off”’ 
annihilation probability increases resulting in a larger Gaussian component 
of the angular correlation distribution, and shorter positron lifetimes, both 
of which are observed experimentally. 

As the valence electron density is increased, the annihilation rate follows 
suit initially, but levels off to a constant value at higher densities. This 
constant value may characterize annihilation from a state similar to that 
of a positronium ion, with the added requirement of rapid electron exchange 
with the valence electrons of the metal in order to retain the characteristic 
parabolic shape of the angular correlation function. 

(6) In metals of reduced lattice spacing, the outer electrons of the metal 
ions are assumed to extend so far into the interstitial regions that the prob- 
ability of positron annihilation with them is comparable with or greater than 
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that with the valence electrons. As a result, a “‘bell-shaped” angular corre- 
lation curve is produced, the associated positron lifetime being shorter than 
that characterizing case (a) above. 


ACKNOWLEDGMENTS 
We are very grateful for the frequent assistance offered by the members 
of the U.B.C. Van de Graaff group in the recording of the data, and for the 
generous facilities offered by the U.B.C. Computing Section for the analysis 
of the data presented here. The willingness of R. Butters of the Metallurgy 
Department in producing the many metal samples to the desired dimensions 


is also much appreciated. 

In addition, one of us (G.J.) wishes to acknowledge the generous assistance 
of the National Research Council of Canada for the award of an N.R.C. 
Studentship for the period of this work. 


REFERENCES 


BELL, R. E. and GrawaM, R. L. 1953. Phys. Rev. 90, 644. 

BELL, R. E. and JORGENSEN, M. H. 1960. Can. J. Phys. 38, 652. 

BeErRKO, S. and PLAsKETT, J. S. 1958. Phys. Rev. 112, 1877. 

Bist, A., Farnt, G., Gatti, E., and Zappa, L. 1960. Phys. Rev. Letters, 5, 59. 

De BENEDETTI, S., Cowan, C. E., KONNEKER, W. R., and Primakorr, H. 1950. Phys. 
Rev. 77, 205. 

De BENEDETTI, S. and RicHINGs, H. J. 1952. Phys. Rev. 85, 377. 

FerGuson, A. T. G. and Lewis, G. M. 1953. Phil. Mag. 44, 1339. 

FERRELL, R. A. 1956. Revs. Modern Phys. 28, 308. 

GarwIN, R. L. 1953. Phys. Rev. 91, 1571. 

Jones, G. 1959a. Bull. Am. Phys. Soc. Ser. II, 4, 367. 

——— 1959b. Ph.D. Thesis, University of British Columbia. 

—— 1960. J. Sci. Instr. 37, 318. 

LanG, G. and De BENEDETTI, S. 1957. Phys. Rev. 108, 914. 

Newtms, A. T. 1956. Natl. Bur. Standards Circ. 577. 

Post, R. F. and Scuirr, L. I. 1950. Phys. Rev. 80, 1113. 

Seitz, F. 1940. The modern theory of solids (McGraw-Hill). 

SELIGER, H. H. 1955. Phys. Rev. 100, 1029. 

Stewart, A. T. 1957. Can. J. Phys. 35, 168. 











THE B"(a,n)N“* AND B(t,2n)C"'§ REACTIONS 
IN A NUCLEAR REACTOR! 


J. C. Roy anp J. J. HAwTon 


ABSTRACT 
Ten-minute N' and 20-minute C" have been produced in the NRX reactor 
by the following reactions: 
B%(a,n)N® and B!(t,27)C". 


The alpha particles and the tritons were generated by thermal neutron capture 
in B!° and Li®, respectively. These reactions offer a simple and rapid method of 
obtaining C" and N* in fairly high specific activities. It is estimated that 
activities of the order of 100 uc of Nand 1 mc of C" can be produced in a thermal 
neutron flux of 6.5 X10" n/cm? sec. Incidental to these studies are some obser- 
vations on the chemical behavior of N83 and C" recoil atoms in a few boron com- 


pounds. 
INTRODUCTION 

The occurrence of a variety of secondary reactions in a nuclear reactor is 
now well established. Many years ago, Knight et a/. (1951) showed that tritons 
from the Li®(m,a) reaction initiate nuclear reactions, and several other studies 
of triton induced reactions, (t,7), (¢,p), and (t,a), have appeared in the literature 
(Almqvist 1950; Iwersen et al. 1953; Banks 1955; Sher and Floyd 1956; Cook 
and Shafer 1954; Stang et a/. 1958). The protons and deuterons arising from 
neutron—hydrogen or neutron—deuterium collisions can also initiate secondary 
reactions in a reactor. The best known example is probably the (d,m) reaction 
of recoil deuterons with O' in a heavy reactor producing curie amounts of 
F'7 (Henderson and Whittier 1957). Further studies of these processes made 
by Glickstein (1959), Glickstein and Winter (1960), and Roy et al. (1960) 
have established that (p,7), (p,y), and (p,@) reactions take place in a reactor, 
and that they can be used to produce tracer amounts of certain light radio- 
nuclides. Moreover, Roy et al. have shown that protons and deuterons 
generated from some (,p) and (n,d) reactions can also induce (p,m), (p,q), 
and (d,n) reactions. Rudenko and Sevast’anov (1959) reported that the 
irradiation of lithium carbonate yields Be’ by a (t,27) reaction. However, the 
work of Roy et al. on the production of Be’ from the neutron irradiation of 
several lithium compounds indicates that the Li7(p,m) rather than the Li®(t,2n) 
reaction is the main process leading to Be’ in natural lithium compounds. In 
these compounds, the protons are generated mostly by a (n,p) reaction on Li®, 

In this paper, evidence will be presented for the occurrence of (¢,2”) reaction, 
namely the B(t,2”)C" reaction, and of another type of secondary reaction 
involving alpha particles, namely the B(a,n)N* reaction. These reactions 
offer a simple and rapid method of obtaining the nuclides C" and N". Incidental 
to these studies are some observations on the chemical behavior of N™ and 
C" atoms recoiling from the nuclear reaction in a few boron compounds. 
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EXPERIMENTAL 

(a) Materials and Irradiations 

Boric oxide, boron powder, potassium metaborate, natural lithium meta- 
borate, and lithium-6 metaborate were used as starting materials. The Li'BO, 
was prepared by fusion of boric acid with lithium-6 carbonate (Brauer 1951). 
A 10- to 500-mg sample was placed in a cylindrical iron or cadmium capsule 
(2.4 cm long by 0.6 cm inside diameter) and irradiated in an empty lattice 
position of the NRX reactor at Chalk River at a thermal neutron flux of 
6.5 X10" n/cm? sec. 


(b) Purification and Separation of N™® 

The separation of N' was based on the distillation of ammonia, and con- 
sequently only the N" that could be obtained as ammonia was recovered. 
This includes N"™ existing either as ammonium ion or as nitrates and lower 
oxides that can be converted to ammonia by treatment with Devarda’s alloy 
(typical analysis of the alloy is 45% Al, 50% Cu, 5% Zn) (Treadwell and Hall 
1942). Henceforth, the N™ converted to ammonia by Devarda’s alloy will 
be referred to as the nitrate fraction. . 

The irradiated target was dumped in a 250-ml flask containing the solvents 
and carriers, and dissolved by heating the solution near boiling. The solution 
was then made strongly basic by addition of NaOH, and the Devarda’s alloy 
added at this stage if NO ;~ carrier was used. The flask was provided with a 
stopper carrying (i) a tube through which air could be introduced during the 
distillation, (ii) an exit tube inserted into a test tube containing an ice-cold 
alcoholic solution of chloroplatinic acid. The basic solution was then heated 
slowly and the ammonia received in the test tube precipitated as ammonium 
chloroplatinate. 


(c) Purification and Separation of C™ 

The separation of C! was based on the distillation of carbon dioxide and 
carbon monoxide, and consequently only the C"™ that could be obtained in 
these two chemical forms was recovered. 

The irradiated target was dumped in a 250-ml flask containing water, 
carbonate carrier, and, in a few experiments, carbon monoxide. The flask was 
provided with a stopper carrying (i) a tube through which air could be intro- 
duced during the distillation, (ii) a dropping funnel through which HCl or 
chromic acid could be introduced, (iii) an exit tube, which provided connection 
for the recovery of the CO, and CO. The CO, released on addition of the acid 
was flushed into the exit tube by heating gently; it was assumed that the CO 
was also flushed out by this technique. For the collection of COs and CO, 
the exit tube was connected to an absorption system formed of three units: 
the first, containing ‘‘anhydrone’’, absorbs water; the second, containing 
‘“fascarite’’ or a solution of barium hydroxide, absorbs the CO»; and the third, 
containing an ammoniacal solution of NH,Cl+CueCle, absorbs the CO 
(Treadwell and Hall 1942). In the first runs on the separation of C", the 
sources were occasionally contaminated by F'!® produced by the O'*(t,m) 
reaction. Since the F!8 activity was carried into the CO, collection unit by a 
small amount of water, it was easily removed by the use of ‘‘anhydrone’’. 
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(d) Counting 

The precipitates of ammonium chloroplatinate or barium carbonate were 
filtered on a “‘millipore’’ filter paper, dried with alcohol, mounted on an 
aluminum tray, and counted. The “‘ascarite’’ was pressed as a cake onto an 
aluminum tray and counted as such. 

Both N* and C" are positron emitters and decay with half-lives of 10 and 
20 minutes respectively. Their radioactivities were measured by counting the 
annihilation radiation with a 3 in. by 3 in. Nal (TI) crystal used in conjunction 
with a 100-channel analyzer. After counting, the chemical yields were measured 
by a determination of the platinum and barium oxide contents of the sources. 
From the counting rates extrapolated to the end of the irradiation, the efficiency 
of the crystal, and the chemical yield, the disintegration rates were determined. 


RESULTS AND DISCUSSION 

(a) Yields 

The y-ray spectra of the sources prepared by the procedures given in the 
previous section exhibit only a photopeak at 0.51 Mev which decays over 
several half-lives with a period of, or close to, 10 or 20 minutes. The values 
of the y-ray energy and the half-lives show that the activities being measured 
are N® and C", respectively. 

The experimental results are summarized in Table I. The different targets 


TABLE I 


Yields of N'3 and C" from the irradiation of boron compounds at a thermal neutron flux of 
6.5 X10'3 n/cm? sec 


Saturation activities of N' and 
C" per 100 mg of target 





Target Carriers Solvents Atoms Disint. /sec 


(A) Production of N'8 


B.O {no cadmium NH,*,NO;— 3N HCl 9.1108 1.1108 
*\ cadmium NH,*t,NO3~ 3N HCl 1.310 1.6X 10? 
Boron powder NH,t 6N HNO; 2.7X10' 3.1310" 
KBO, NH,*,NO;- 3N HCl 8.7X105 1.0X 108 
LiBO» NH,+,NO;- 3N HCl 2.6X 108 3.0X 108 
Li®BO, NH,*t,NO;— 3N HCl 1.5X10' 1.7xX108 
(B) Production of C¥ 
{no cadmium CO;~ HCl 6.4107 3.7 X10 
LiBO2{ no cadmium CO HCl 1.2108 7.4X10? 
{cadmium CO;- HCl 2.1X10° 1.2108 
LiBO. CO; Chromic 7.0X107 4.0104 
acid 
LitBO, CO;- HCl 1.3X10° 7.5X105 





that were irradiated, and the conditions of the irradiations, are given in column 
1. The nature of the carriers and solvents used to recover the N® and C" activi- 
ties are listed in columns 2 and 3. Finally, the saturation activities of N'* and 
C" for 100 milligrams of target irradiated at a thermal neutron flux of 6.5 X10" 
n/cm* sec are given in the last two columns, in atoms and in disintegrations per 
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second respectively. Each entry represents the average of several experiments; 
the reproducibility of the results is estimated to be about +15%. It should 
be emphasized that the yields of N* are valid only for the set of experimental 
conditions given in the table. This is because the yields of N" depend on the 
treatment of target after irradiation (see next section). 


(b) Chemical Behavior of N* and C" Recoil Atoms 

Hudis (1960) has discussed some of the problems associated with the sepa- 
ration of radioactive nitrogen and carbon atoms in his recent monograph on 
the radiochemistry of these elements. The two main problems are those of 
isotopic mixing of carrier and tracer atoms and their complete conversion to 
known oxidation states; these arise because the two elements can exist in 
many different oxidation states, giving rise ultimately to metastable and 
stable species between many of which isotopic exchange reactions are slow 
(Wahl and Bonner 1951). The nature of these species depends strongly on the 
structure of the target, the type and conditions of the irradiation (Harbottle 
and Sutin 1959), and obtaining a detailed knowledge of them is, in most 
cases, a very complex problem. As a consequence, it is always difficult to 
determine the absolute yields of Nand C" in systems where the chemical 
separation of these radionuclides is involved; rather it is usual to assume that 
some of the radioactive nitrogen and carbon atoms would be formed in some 
common oxidation states, leading to compounds which can be easily separated. 
This procedure has been followed in this study. No attempt has been made to 
separate all the possible nitrogen and carbon compounds containing N™ and 
C"; it has simply been assumed that some of the N® and C! will be finally 
incorporated in an ammonium or nitrate ion, while some of the C" will 
eventually form’ CO, and CO. The direct determination of the total amount 
of N® and C" atoms, by measuring their activities in the target prior to their 
separation, is not possible because of the presence of other radioactivities. 

We have made a preliminary investigation of the chemical behavior of N¥ 
atoms produced by the B!°(a,m) reaction from the thermal neutron irradiation 
of several boron compounds and the observations made so far indicate that 
the recovery of N" atoms as NH,4t and NO>°~ and its distribution between the 
two species depend on the following factors: 

(i) The nature of the target. 

(ii) The solvents; striking differences are observed in the distribution of 
N! between the species NH,+ and NO;~ when the solvent is changed from 
a strong base to a strong acid. 

(iii) The temperature at which the target is dissolved seems also to influence 
the recovery of N as NH,*+ and NO>~ ions. 

(iv) The annealing of the irradiated target before dissolution. 

A few experiments were done to determine the distribution of C! atoms 
between CO», CO, and elemental carbon in natural lithium metaborate. First, 
the distribution of C!! atoms between CO, and CO was determined by the 
methods described in the experimental section using HCI as the solvent. The 
data in Table I show that the ratio C"O2/C"O is about 50, indicating that the 
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C" atoms react with the substrate to form COz rather than CO. Secondly, 
the distribution of C™ atoms between CO, and elemental carbon was deter- 
mined by dissolving the target in a hot mixture of chromic-sulphuric acid. 
By this treatment elemental C should be oxidized to CO, (Hillebrand et al. 
1953); any increase in the yield of C" under these conditions would indicate 
that some C'! atoms exist as such in the target. Data in Table I show a slight 
increase, but it is within the limits of reproducibility of the results, estimated 
to be +15%. Hot chromic-sulphuric acid mixture is likely to oxidize to CO» 
other chemical species resulting from the reaction of C' with other atoms 
present in the target. From these observations, it is concluded that C" pro- 
duced by the B"(t,2”) reaction in the thermal neutron irradiation of lithium 
metaborate reacts with the substrate to form mainly CO: or some chemical 
species that can be easily transformed to COz on dissolution. 


(c) Discussion of the Mechanism of the Nuclear Reactions 

The reactions involved in the formation of N' and C" from the neutron 
irradiation of lithium and boron compounds in a reactor can be established 
from an examination of the data assembled in Table I. 

From the yields of Nin boron oxide irradiated with and without cadmium, 
it is clear that N" arises from reactions involving thermal neutrons. Next, the 
high yield of N' in boron powder target shows that neutron irradiation of 
boron isotopes only is sufficient to produce N'. The interaction of thermal 
neutron with B", for which the cross section is 3810 barns, produces 1.8-Mev 
alpha particles. These alpha particles could then react with boron to yield 
N', Since the B!°(a,m) reaction (Q-value = 1.07 Mev) is the only one that can 
yield N with 1.8-Mev alpha particle on the basis of nuclear reaction Q-values 
(Everling et al. 1961), it is concluded that an alpha-induced reaction takes place 
in the reactor. 

The 2.0-Mev alpha particle generated by thermal neutron capture in Li® 
should also initiate the B'(a,n)N™ reaction. This should be shown experi- 
mentally from the irradiation of natural lithium metaborate and lithium-6 
metaborate; on the basis of the isotopic abundance and neutron capture cross 
sections of Li‘ and B" respectively, the yield of N' should increase by a factor 
of 2.1; a further increase is expected because the alphas from Li® are more 
energetic than those coming from B"” by 0.2 Mev. Hence, provided that the 
other factors influencing the yield of N' remain the same, one should expect a 
total increase of a factor of 3 to 4. As seen in Table I, one observed an increase 
by a factor of 6, which can be considered in fair agreement with the predicted 
value. 

The yields of C"™ obtained from the neutron irradiation of natural lithium 
metaborate, with and without cadmium, show that 96% of the activity arises 
from a reaction involving thermal neutrons. This rules out the possibility that 
C" was produced by (p,y) or (p,m) reactions on B" or B", because the genera- 
tion of energetic protons by (,p) reactions or by neutron—hydrogen collisions 
(LiBO: could have contained a small amount of water) requires fast neutrons 
(Roy et al. 1960). The 20-fold increase in the rate of formation of C" with a 
lithium-6 metaborate over the natural compound shows that Li® is involved 
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in the reaction. The interaction of thermal neutrons with Li®, for which the 
cross section is 945 barns, produces alpha particles and tritons with an energy 
of 2.0 and 2.73 Mev respectively. Of all the possible routes leading to C", 
the only one which is consistent with the experimental data, and which can 
take place on the basis of nuclear reaction Q-values (Everling et a/. 1961), isa 
(t,2n) reaction on B"” (Q-value = 0.217 Mev). 


(d) Application of the Reactions to the Production of N® and C® 

We shall review now the best conditions under which N™ and C" can be 
produced by the (a,m) and (¢,2”) reactions respectively on B". 

Suitable boron-containing targets stable under high neutron flux are limited 
almost exclusively to boron oxide, the borates, and boron powder. Because of 
its ease of handling, its stability, and good yields of N'* as NH,* ion, boron 
powder is one of the best targets even though it is readily soluble only in 
nitric acid. The presence of a large amount of nitrate carrier, resulting from 
the dissolution in nitric acid, makes it impractical to recover the N'* as NO;-. 
Because of the large cross section for the B'(n,a)Li’? reaction, the neutron 
flux, and consequently the alpha flux, is depressed inside the sample with the 
result that the rate of formation of N* depends on the thickness, the shape, 
and the size of the sample. The variation of the yield of N versus the amount 
of natural boron powder has been plotted (for the irradiation geometry used 
in these experiments) in Fig. 1. It is seen that a maximum yield is obtained 
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Fic. 1. Yield of N'’ versus the amount of natural boron powder for the irradiation con- 
ditions described in the text. 
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for about 100 mg of sample, which is equivalent to a saturation activity of 
about 1 uc of N'*. This yield could be increased substantially by using enriched 
B” and by spreading the sample in a thin annular shell in a suitably designed 
capsule. By using boron-10 enriched to 93% the yield will increase as the 
square of the enrichment factor, that is 25 times: 93% boron-10 can be 
obtained from the Stable Isotope Division of the Oak Ridge National Labora- 
tory for $4.00 per gram of boron powder. Next, by using thin target, a further 
few-fold increase in the yield could be achieved with the result that amounts 
up to 100 uc of N* could be produced in a nuclear reactor. 

Because of the high neutron absorption cross section of lithium metaborate, 
the yield of C", as in the case of N", will also depend on the thickness, the 
shape, and the size of the sample. It has been estimated that for the irradiation 
geometry used the maximum C" activity will be obtained for a weight of 400 
to 500 mg of lithium-6 metaborate. From this weight and the yield given in 
Table I, it is calculated that a saturation activity of 80-100 uc of C" will be 
produced at a flux of 6X10" n/cm? sec. This yield could be increased several- 
fold if a thin sheet of Li®-boron alloy were irradiated, with the result that 
millicurie amounts of C" could be produced by this reaction. 

It is worth noting that the neutron irradiation of lithium-6 metaborate 
will give rise to three positron emitters, N™“, C", and F!8. These could be 
isolated in a carrier-free state. 
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THE AZIMUTHAL DISTRIBUTION OF ULTRA-HIGH FREQUENCY 
RADAR ECHOES FROM AURORA! 


P. E. KEtty, D. R. HANSEN, AND P. A. ForsyTH 


ABSTRACT 


The distribution in range and azimuth of auroral echoes obtained with an 
ultra-high frequency radar at Prince Albert is examined. Such distributions 
may be controlled (i) by the effect of volume integration in the auroral con- 
figurations, (ii) by the geographical distribution of aurora, and (iii) by aspect 
sensitivity of the aurora] scatterers. With the radar parameters chosen for this 
experiment, (i) can be neglected. Of the remaining two effects, (ii) appears to be 
more important in the period immediately before local midnight and (iii) in the 
period immediately following local midnight. 


INTRODUCTION 


The mechanism by which radio waves are scattered in the lower ionosphere 
during auroral disturbances is still very poorly understood. The size and shape 
of the scattering centers have not been established. Several attempts have 
been made to interpret, in terms of the “aspect sensitivity”’ of the scatterers, 
the distribution in range and azimuth of echoes observed with a specific radar 
(the so-called azimuthal distribution). This approach assumes that the pre- 
ferential occurrence of echoes in one direction (geomagnetic north for stations 
near the northern auroral zone) is due to the elongation of the clouds of auroral 
ionization along the geomagnetic field lines. While aspect sensitivity un- 
doubtedly exists, it was pointed out by Forsyth (1960) that the degree of 
aspect sensitivity (or elongation of the scatterers) could not be accurately 
estimated from existing radar data. Most of these data were obtained using 
very high frequency radars having wide antenna beam-widths and short pulse 
durations. These two characteristics combine to produce a type of observational 
selection that seriously modifies the azimuthal echo distribution. 

Recently, new radars operating in the ultra-high frequency band have come 
into use. These radars are capable of much better spatial resolution than are 
the older v.h.f. radars and so may shed new light on the reflection mechanisms 
involved in radio-auroral reflections. It is already clear that the general 
characteristics of u.h.f. auroral radar echoes are significantly different from 
those observed at v.h.f., nevertheless, an understanding of the u.h.f. reflection 
mechanism is likely to be of value in studies at lower frequencies. 

This paper grew out of a program of measurements started early in 1960, 
using the Prince Albert Radar. The program was designed to examine several 
different aspects of the reflection mechanism but was interrupted in January, 
1961, when much of the equipment and most of the records were destroyed 
by fire. The results of a partial analysis of some of the radar records survived 

1Manuscript received July 27, 1961. 
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the fire and this paper is based on those results. Although the data are much 
fewer than would be needed for a definitive investigation, the results may be 
of some value in studies of auroral ionization. Therefore, it seems appropriate 
to report them at this time. 


RESULTS AND ANALYSIS 
The parameters of the radar equipment for the investigation are given in 
Table I, and the details of the observations are given in Table II. The pulse 


TABLE | 


Radar characteristics during the experiment 


Location of radar Prince Albert, Sask.: 53.2° N., 105.9° W. 








Antenna Paraboloid, 84 ft diameter 

Antenna beam-width Approximately 2.5° 

Antenna scanning mode Azimuth scan, 180° centered at true north 
Elevation scan, stepping at 2° intervals 

Transmitter peak power 600 kw 

Transmitter pulse duration 300 microseconds 

Receiver bandwidth 45 kc/s (effective) 

Receiver noise figure 3 (5 db) 

Frequency 448 Mc/s 

Polarization Circular: left transmitted, right received 
TABLE II 


Details of the observations June 10-11, July 13-25, 1960 


Before midnight After midnight 








Elevation angle No. of No. of No. of No. of 
(degrees) scans echoes scans echoes 

1 220 78 308 294 

3 220 170 308 667 

5 220 239 308 834 

7 220 106 308 782 

9 220 97 308 541 

11 220 10 308 376 





duration used (300 microseconds) corresponded to a range increment of about 
45 km. At a range of 800 km the width of the radar beam was about 40 km. 
In consequence, the volume of space that could simultaneously contribute to 
a radar echo might be approximated by a cylinder having a length of 45 km 
and diameter varying with range about a value of 40 km. In fact, the sensitivity 
of the radar is not constant throughout this volume, but is a maximum for a 
line scatterer located along the cylinder axis and decreases smoothly to zero 
as the line scatterer is reduced in length or displaced from the axis. The response 
of the radar to volume scatterers of different shapes within the cylinder is 
extremely complex but such variations would be minor compared with the 
gross observational effects discussed by Forsyth (1960). For example, if the 
radio scatterers were disposed in arcs, like those of the optical aurora, the 
radar sensitivity would be relatively independent of the orientation of the 
arcs with respect to the radar. Thus, for the particular combination of pulse 
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length and antenna beam-width used, the effects of volume integration on the 
azimuthal distribution can be neglected. There remain the effects of geo- 
graphical distribution of aurora and of aspect sensitivity. 

The records consisted of photographs of the radar display, each showing the 
location (in azimuth angle and range) of all the echoes obtained in one azimuth 
sweep (scan) of the antenna at a constant elevation angle. The photographs 
were analyzed by recording the occurrence of echoes within 5-degree intervals 
for each antenna sweep, and deriving the azimuthal distribution of echoes for 
each elevation angle. These distributions were normalized to indicate occur- 
rence of echoes as a percentage of possible observations. Each value then 
represented the percentage of the scans for which echoes were observed (or 
deduced probability of echo occurrence, expressed in hundredths) within each 
5-degree azimuth sector for a given elevation angle. The range information 
on the original photographs was lost in this analysis. However, experience has 
shown that only rarely does the height of the echoing region depart significantly 
from 100 km so that the range information could be recovered simply by 
computing the range of intersection of the radar beam with the 100-km level 
for each elevation angle. In this way it was possible to compute the range and 
azimuthal distribution of the echoes. 

The original data were divided into two groups, corresponding respectively 
to the 6 hours before and the 6 hours after local midnight (7 hours U.T.). 
This division was made because it appeared from visual observation of the 
radar displays that the characteristics of the echoes observed in the early 
part of the night were qualitatively different from those observed in the later 
(early morning) hours. 


CONSIDERATIONS CONCERNING THE REFLECTION MECHANISM 

If it is assumed that the effects of volume integration are absent from the 
observations, then the principal factors which govern the azimuthal distri- 
butions should be the geographical distribution of aurora and aspect sensitivity. 
The geographical distribution of radio-aurora is not known with accuracy, 
but there is evidence that the distribution in geomagnetic latitude for v.h.f. 
radio-aurora is similar to that found by Vestine (1944) for optical aurora 
(Forsyth, Green, and Mah 1960). The anticipated effect of aspect sensitivity 
is to increase the occurrence of echoes in regions where the aspect angle (the 
angle between the direction of incidence of the radio waves and the direction 
of the geomagnetic field lines) is near 90°, and to decrease the occurrence as 
the aspect angle departs more and more from this value. 

The area at the 100-km level that is represented by a single block of radar 
data (referring to a specific elevation angle and a 5-degree azimuthal sector) 
varies with elevation angle, or range. It is more convenient to treat the data 
when expressed as occurrence per unit area. The observations were corrected 
for this area effect. To do this, each occurrence figure was multiplied by the 
ratio of the area illuminated by the radar beam at 1° elevation angle to that 
illuminated by the radar beam at the elevation angle concerned. 

A radar is inherently less sensitive to a target at longer ranges than to one 
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at shorter ranges. It is desirable in discussing the echo distribution to correct 
it for this range dependence. Unfortunately, the range dependence is, in turn, 
governed by the reflection mechanism and by the characteristics of the radar 
receiver and display. The nature of the relationship between target detect- 
ability and signal power has been discussed for a particular radar by McNamara 
(1960). He has shown that the relationship depends critically upon the dis- 
tribution of target cross sections and the recording characteristic of the radar. 
While the latter can be determined quite accurately, the former is in some 
doubt. From his discussion it seems that no great error will result if a linear 
relationship between echo power and echo detectability is used for the present 
purpose. Any departure from this linear relationship will lead to a change in 
the apparent range dependence of the echoes. In general, the range dependence 
of echo power varies with the size and shape of the scatterers. The echo power 
produced by a small isotropic scatterer varies inversely as the fourth power of 
range (R-‘). The echo power from a long, cylindrical (highly aspect sensitive) 
scatterer varies as R~ and for a large surface as R~*. A scatterer that is much 
smaller than the principal Fresnel zone (a few hundred meters for the u.h.f. 
radar) will exhibit an R~* dependence even when quite aspect sensitive. 


DISCUSSION 

It seems likely that both aspect sensitivity and geographical distribution 
affect the observations, but to differing degrees in the periods before and after 
midnight. The echo occurrences obtained in the earlier period (before midnight) 
are shown plotted as contours on a map of the area north of Prince Albert in 
Fig. 1(a) and (0) together with the position of the center of the auroral zone. 
Both of the echo distributions include the area correction. In addition, Fig. 
1(a) shows the echo distribution corrected for an R-* dependence, and Fig. 
1(b) the same data corrected for an R~* dependence. In Fig. 2, the echo distri- 
butions corresponding to the period after midnight are shown together with 
the contours of constant aspect angle. Again, in addition to the area correction, 
the data of Fig. 2(a) have been corrected for an R-* dependence and that of 
Fig. 2(b) for an R~* dependence. 

Both of the distributions of Fig. 1 show the same trend but the concentration 
of echoes is more marked in Fig. 1(b). The nature of the echoes in the earlier 
part of each auroral display (discrete, sharply defined echoes sometimes dis- 
tributed in the general shape of common auroral forms) suggests that the R~4 
range dependence should be appropriate. In either case there is a concentration 
of echoes at or near the range where the auroral zone cuts across the area of 
surveillance. Perhaps more significant is the extension of the contours to the 
north-west. The most rapid variation of echo occurrence takes place in a 
region where the aspect angle changes only by a fraction of a degree, whereas 
the slow variation of echo occurrence takes place in the region of most rapid 
change of aspect angle. It seems that aspect sensitivity plays only a secondary 
role in determining the echo distribution. It may also be noted that the 
variation of echo occurrence with geomagnetic latitude is much more rapid 
than that suggested by Vestine’s data or by the radio results obtained at lower 
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Fic. 1. Maps of the area to the north of the radar station, showing contours of constant 
echo occurrence for the period before local midnight. The observed echo occurrence figures 
have been corrected as described in the text. The range dependence assumed in Fig. 1(a) is 
inverse third power, and in Fig. 1(b) is inverse fourth power. The broken line indicates the 
approximate position of the center of the auroral zone. 


frequencies. This suggests that for the discrete echoes observed before mid- 
night, there is a tendency for the echoes to be confined more to the neighbor- 
hood of the auroral zone with increasing frequency. Such a tendency was 
found in the dual-frequency radar results obtained at Saskatoon by Currie, 
Forsyth, and Vawter (1953). 

In Fig. 2, a different situation is evident. The gradient of echo occurrence 
is generally similar to the gradient of aspect angle and, particularly in Fig. 
2(a), the shape of the aspect angle contours are in places quite accurately 
reproduced. It is difficult to choose between the two range dependences on 
any theoretical basis but the R-* dependence of Fig. 2(a) seems to provide the 
better fit. This may be due to partial filling of the radar beam rather than to 
the presence of very elongated scatterers. A somewhat better fit between the 
echo contours and the aspect angle contours would result if the former contours 
were rotated by about five degrees in the clockwise direction relative to the 
latter contours. The accuracy with which the angular data were obtained from 
the film does not preclude such an error but it is not clear how a systematic 
error of this magnitude could arise. In any case, the distribution seems to 
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Fic. 2. Maps corresponding to those of Fig. 1 for the period after local midnight. The 
broken lines represent contours of constant aspect angle. For reasons of clarity the aspect 
angle contours have been labelled only in Fig. 2(0). 


reveal little control due to geographical distribution, but is consistent with 
control by aspect sensitivity. 

It sometimes has been suggested that auroral reflections occur only for the 
specular condition (aspect angle, 90°) and that when high-latitude echoes are 
observed, the geomagnetic field lines are distorted sufficiently to satisfy this 
condition. The asymmetry of the aspect angle contours about the geomagnetic 
meridian is due to small, relatively local irregularities in the geomagnetic field 
as observed at the surface of the earth. This asymmetry is preserved in the 
echo contours of Fig. 2 (but not in those of Fig. 1). In order to fulfill the specular 
reflection condition in the neighborhood of the first and second contours the 
magnetic field direction would need to be depressed by five and six degrees 
respectively, presumably by strong currents flowing in or near the 100-km 
level. It seems hardly likely that the local irregularities of the surface field 
would be maintained to the 100-km level during such violent disturbances. It 
is more likely that for the reflections concerned the aspect angles are approxi- 
mately those indicated in Fig. 2, and that the observed echo distribution of 
Fig. 2 can be used to estimate the angular distribution of the scattered 


radiation. 
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The variation of backscattering cross section with aspect angle for small, 
weak, elongated scatterers having an axis ratio that is substantially greater 
than unity, is given approximately by 


a(0) < exp(—2k?L? cos? @) 


where & is the wave number (27/\), @ is the aspect angle, and L is a length 
parameter which corresponds to the physical length of a simple scatterer. 
The linear relationship between echo power and detectability, assumed earlier, 
may be invoked again in order to apply the cross-section expression to the 
contours of Fig. 2. It appears from that figure that an increase of aspect angle 
from 95° to 96° corresponds to a decrease in detectability of the auroral 
scatterers by a factor of about 2. Hence, 


In 2 = 2k*L?(cos? 96° —cos? 95°), 


which yields a value of L~1 meter. This is the length of the individual 
scatterer (measured along the magnetic field) in an assembly of identical 
scatterers which would produce a degree of aspect sensitivity equal to that 
observed. It is more realistic to expect that there will be a distribution in 
size of scatterers. This situation has been examined by Moorcroft (1960, 1961¢, 
19610), who has shown that the correct physical interpretation of the parameter 
L is complicated and differs for different distributions in size and in strength 
of the scatterers, and for different observing wavelengths. 

It may also be of interest to extrapolate the occurrence data to an aspect 
angle of 90°. From the cross-section expression, using the derived value of L, 
it follows that the cross section of the scattering region would increase by a 
factor of about 5 for a decrease in aspect angle from 95° to 90°. If this were 
associated with a corresponding increase in echo occurrence, then echoes of 
this type could be observed nearly continuously (during the appropriate 
hours) if observed at most favorable aspect angles. While the extrapolation of 
echo occurrence in this way is not likely to be valid, it does emphasize the 
fact that, because of aspect sensitivity, the observed echo strength (and hence, 
echo occurrence) for these echoes is substantially less than it would be if 
observed under more favorable geometrical conditions. 

The echoes observed after local midnight characteristically are of the 
‘‘diffuse’’ type described by Leadabrand (1961) and it is presumably to this 
type of echo that the preceding remarks apply. On the other hand, the echoes 
observed before local midnight characteristically are of the ‘‘discrete’”’ type 
and the aspect sensitivity of these scatterers seems to be less. A reduction in 
aspect sensitivity could result from a still smaller value of Z but the value of 
1 meter deduced for the diffuse scatterer is already of the order of the mean 
free path in the echoing region. The cross-section expression used earlier can 
be written more accurately for a spheroidal scatterer as 


a(6) « exp{ —2k*’a?(h? — 1)cos?6} 


where a is one of the two (equal) minor axes and h is the ratio of major to 
minor axis. The aspect sensitivity of large scatterers can be small provided 
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the axis ratio, h, is not much greater than unity. If the discrete echoes observed 
at u.h.f. arise from the same scatterers as those giving rise to v.h.f. discrete 
echoes, then the appropriate value of a is probably at least several meters. 

In any case, Table II shows that the relative occurrence of echoes after 
midnight was greater than the occurrence before midnight, by a factor of 
more than 3. If the same scattering regions had been observed at more favor- 
able aspect angles the ratio of relative occurrence in the two periods might 
have increased substantially. It seems likely that for observations where the 
specular reflection condition is fulfilled the diffuse post-midnight echoes would 
dominate the echo statistics. This seems to have happened for the observations 
of Fricker, Ingalls, Stone, and Wang (1957) which were made at Boston 
where the specular reflection condition can be fulfilled. At such a low latitude 
the occurrence of both discrete and diffuse scatterers should be considerably 
less than for the present observations but the detectability of the diffuse 
scatterers should be considerably greater. It is probably this combination of 
circumstances that has led some observers to the conclusion that virtually 
all of the observed scatterers are highly aspect sensitive. 

It is worth noting that while the ‘‘before midnight’ period nominally 
covered 6 hours, most of the echoes were actually observed in the 2 or 3 hours 
immediately before midnight. Since the number of occurrences is small, it is 
not possible to determine from the present data whether the echo distribution 
shown in Fig. 1 is characteristic of the time period (6 hours before midnight) 
or simply of the early part of an auroral display. Perhaps with more extensive 
data covering displays which started at various times, the particular charac- 
teristics noted would be submerged by those of the later, more numerous, 
echoes. The secondary concentration of echoes near maximum range in the 
neighborhood of true North is of particular interest. It is well established that 
in this region the western ends of auroral arcs (most of which occur in the 
period before midnight) tend to be directed well to the north of the line of 
local geomagnetic latitude (Jensen and Currie 1953). The secondary concen- 
tration of echo contours in Fig. 1 forms a ridge that is oriented approximately 
in the same direction as the auroral arcs. This direction is also approximately 
the direction of the locus of peak auroral activity found, for example, by 
Green (1961) and predicted by Axford and Hines (1961) for the period 
immediately before midnight. 
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TRANSMISSION AND REFLECTION OF ELECTROMAGNETIC 
WAVES AT A PLASMA BOUNDARY FOR ARBITRARY 
ANGLES OF INCIDENCE! 


K. A. GRAF AND M. P. BACHYNSKI 


ABSTRACT 


The interaction of a plane electromagnetic wave with a flat free-space — plasma 
interface has been considered for arbitrary angles of incidence. It is shown that 
the plasma can support independent horizontally and vertically polarized waves. 
Expressions and graphical representations are given showing the amount of 
energy entering the plasma as a function of angle of incidence and plasma para- 
meters. The vertically polarized case shows a maximum in the energy entering 
the plasma at the ‘Brewster angle’. For a lossy plasma, at this maximum, there 
will be reflection. Loci of constant Brewster angle appear as concentric curves 
centered on the origin of the complex dielectric coefficient plane. 

The elliptical polarization of a plane wave reflected from the interface, when 
a wave with equal horizontally and vertically polarized components is incident 
on the interface, suggests the similarity of lossless plasmas to ordinary dielectrics 
and of lossy plasmas to metals. 


I. INTRODUCTION 


The propagation of electromagnetic waves in unbounded uniform plasmas 
is well known (Ratcliffe 1958; Bachynski, Shkarofsky, and Johnston, in 
press). However, for plasmas of practical concern (the ionosphere, plasma 
sheaths surrounding hypervelocity vehicles, laboratory plasmas, etc.), the 
interaction of electromagnetic waves is determined not only by the physical 
properties of the plasma but by the shape and nature of the boundaries as 
well. The reflection and transmission of electromagnetic waves at a plasma — 
free-space interface has not been treated extensively except for special cases 
of lossless plasmas or normal angles of incidence. Both of these cases lead to 
plane wave solutions which are not valid for lossy plasmas when the r-f. 
energy is incident at an angle of incidence other than normal. 

In this paper, the transmission and reflection of energy that result when a 
plane electromagnetic wave is incident at an arbitrary angle on a flat plasma — 
free-space boundary is considered. By describing the plasma in terms of an 
effective dielectric coefficient, relatively simple expressions for the electro- 
magnetic fields in free-space and in the plasma can be determined from 
Maxwell’s equations. Application of the boundary conditions then specifies 
the amplitude and phase of the reflected and refracted fields. The analysis 
is greatly simplified by writing the boundary conditions in terms of the fields 
in the plasma. With such an approach, the introduction of complex angle 
notation, as used by Stratton (1941) for lossy media, is unnecessary. In the 
limiting case of a lossless plasma or normal incidence, the expressions derived 
here yield the familiar Fresnel formulas. 


1Manuscript received August 9, 1961. 
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A number of specific cases for the energy entering the plasma as a function 
of angle of incidence have been evaluated with the aid of an IBM 650 com- 
puter. The variation of the Brewster angle with plasma parameters is con- 
sidered. In addition, the elliptical polarization of a reflected plane wave, when 
the incident wave has equal components both in the plane of incidence and 
perpendicular to the plane of incidence, is derived and shown to be a possible 
parameter for a radio-frequency diagnostic technique. 


Il. ELECTROMAGNETIC FIELDS IN A UNIFORM ISOTROPIC PLASMA 


Consider a sharp, plane, free-space —- plasma boundary irradiated with a 
plane electromagnetic wave. The Poynting vector of the plane incident wave 
makes an arbitrary angle @ with the normal to the interface. The plasma is 
considered to be neutral, uniform, and isotropic and to have electrical pro- 
perties which can be described in terms of an effective dielectric coefficient. 
This assumes the plasma properties to be adequately described by the 
number density of electrons and their effective frequency of collision with the 
heavier species of the plasma. The motion of the less mobile ions is neglected, 
although the ion effects can be readily included by using the appropriate 
expression for the dielectric coefficient. In addition, the field strength of the 
incident electromagnetic wave is considered small enough so as not to alter 
the plasma properties. That is, any bunching of electric charge due to inhomo- 
geneous waves launched in the plasma is neglected. 


II.a. Maxwell's Equations for a Plasma 

In a uniform neutral isotropic plasma, where the permittivity (€9) and 
permeability (uo) are the same as for free-space, Maxwell’s equations for a 
harmonically varying field («') may be written, using rationalized m.k.s. 
units, as: 


(1a) V-E = p/e0, 

(10) V-H = 0, 

(1c) VXE = —jwuoH, 
(1d) VXH = J+jewE, 


where E and H are the electric and magnetic fields, respectively, 

p is the net charge per unit volume, 

J is the current density, 

w is the frequency of the electromagnetic wave, and 

jisv-—l. 
The plasma can be considered as a ‘‘dielectric’’ described by the complex 
dielectric coefficient K. For a uniform, isotropic plasma (Bachynski, Johnston, 
and Shkarofsky 1960) 


“ b cae Wp ; x fe fi | a. 
oe [1-(*) Ore =) il (*) al 
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(K, will be less than unity; K, will be greater than zero) 
where w, is the plasma frequency, given by w) = +~/ne?/meo, 
n is the number density of electrons, 
—e,m are the electronic charge and mass, respectively, 
vis the effective collision frequency of an ‘‘average”’ electron. 
With such a dielectric coefficient, the electric displacement (D) becomes 


D = «KE. 
Maxwell’s equations (la and 1d) can then be written 
(2a) V-KE = 0, 
(20) VXH = jweoKE. 


For a uniform, isotropic plasma, VK is zero in the absence of an electro- 
magnetic wave. Since we have assumed that the wave is weak enough so 
that at all times VK is negligibly small, equation (2a) becomes 


(2c) V-E = 0. 


Maxwell’s equations give the Helmholz vector equation for the propagation 
of electromagnetic waves, namely 
(3) VE+R°KE = 0. 

The wave equation for the magnetic field is similar. 

It is convenient to find a solution for the Helmholz equations before obtain- 
ing a solution for Maxwell’s equations. Maxwell’s equations must, of course, 
be used to establish the magnitude and phase relations between the electric 
and the magnetic fields. 


II.b. Waves in a Bounded Plasma 
The co-ordinates used in the succeeding discussion are shown in Fig. 1. 
The electromagnetic fields in the plasma must be related to the fields on the 


x y is out of paper 
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Fic. 1. Co-ordinates of free-space — plasma interface. 
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interface. This imposes the restrictions that the fields must not vary as a 
function of y (i.e. d/dy = 0). As well, the phase velocity of the wave in the 
z direction (V,) must be the same as the phase velocity in the z direction at 
the interface. The second condition gives 


0/dz = —]jé 


where 6 = ksin @, 
k = 2n/\ = free-space wavenumber, and 
d is the free-space wavelength. 
With these two conditions, the Helmholz equation becomes (for the E field) 
OE. a 
(4) —=t-—8E, = —kKE, 
oh 
where n refers to any component x, y, or z. The solution of the Helmholz 
equation is 


(5) EE, = En ePtt+I(Qr—5z+w!) 
1 


where the £,,’s are amplitudes of the components of the field at the interface, 
and may be complex, indicating phase differences between the various compo- 
nents. P and Q are real and positive, corresponding to waves moving in the 
—x direction, that is, away from the interface into the plasma. 

P corresponds to an attenuation coefficient; Q corresponds to a phase 
coefficient. It should be noted that P and Q specify attenuation and phase 
along the x-axis (i.e. normal to the interface) rather than in a direction normal 
to the surfaces of constant phase, or in the direction of the Poynting vector. 
P and Q are given by 


(6a) P = k{44/[(Kr—sin’ 0)’4+Kj]—4(K,—sin’ 6)}}, 
(6b) Q = k{3\V/[(Kr—sin® 6)?+Ki]+4(Kr—sin’ 6)}}. 

Equation 5 shows that the phase velocity in the z direction is w/6; the 
phase velocity in the x direction is —w/Q. The normal to planes of constant 


phase makes an angle of ¥ with the normal to the interface, where y is given 
by 


(7) tany = ksin6/Q. 


Equation 5 also indicates that planes of constant amplitude are parallel to 
the interface (see Fig. 2). 

The normalized attenuation and phase constants in the x direction (P/k and 
Q/k), for various values of K, are shown in Figs. 3a and 30. For the lossless 
case, either P/k or Q/k will be equal to zero depending on whether or not the 
wave is ‘“‘cut off’. When the wave can propagate in the plasma, P/k is zero, 
indicating no attenuation. When the wave is cut off, P/k corresponds to the 
attenuation of the fields of the Sommerfeld solution (see Section III.e). Both 
P/k and Q/k are zero at the critical angle 6,, the smallest angle at which 
total reflection occurs. 
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Fic. 2. Planes of constant amplitude and constant phase in the plasma. 


When there is loss (K, > 0), neither P/k nor Q/k will be zero. It can be 
seen that as the plasma becomes more lossy (KA; increases) the attenuation 
increases (P/k increases). The penetration depth or skin depth, the depth at 
which the amplitude of the envelope of the vector fields has decreased to 1/e 
of the surface value, is given by 


d, = 1/P. 


Figure 3a shows a d, scale as well asa linear P/k scale. Note that an attenuation 
constant of two, corresponds to a skin depth of about 1/12th of a wavelength. 


III. TRANSMISSION AND REFLECTION AT A PLASMA BOUNDARY 

Since Maxwell’s equations show that E,, H,, and H, are independent of 
E,, E,, and H,, it is convenient to consider two separate waves in the plasma. 
These waves will be referred to as horizontally polarized (designated by 
subscript ‘‘h’’), and vertically polarized (designated by subscript ‘‘v’’). A 
wave with the electric vector perpendicular to the plane of incidence (the 
x—z plane) is referred to as horizontally polarized; a wave with the electric 
vector parallel to the plane of incidence is referred to as vertically polarized. 
Any incident wave can, of course, be resolved into these two components. 


IIT.a. Boundary Conditions 

(t) Horizontal Polarization 

The boundary conditions that must be satisfied specify that the electric 
displacement normal to the interface, the electric field tangential to the 
interface, and the magnetic field at the interface must all be continuous across 
the boundary. 

For a horizontally polarized incident wave we have 


E, = E, = 0; H, = 0. 
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The boundary conditions give 


(8a) H,—Hs = H,/cos @, 
(8b) H,+H, = H,/sin 6, 
(8c) E, +Epz, = —E,. 


The subscripts A and B refer to “incident” and “‘reflected’’ plane waves, 
respectively. Fields so designated are on the free-space side of the interface. 
The components not designated A or B are on the plasma side of the interface. 

The relation of the refracted wave to the incident and reflected plane waves, 
written in terms of the x component of the incident magnetic vector (H,), is 





a ae ) 
H, 2\sin@ ' ksin@cos@ Jk sin 6 cos 6/’ 








S8. Decal eee ae ) 
H, 2\sin@ ksin@cos@ Jb sin 0 cos 6/’ 
Hp _ cos 0—(Q/k)+j(P/k) 
Hy, cos6+(Q/k)—j(P/k)’ 


where R, is the reflection coefficient for a horizontally polarized incident plane 
wave. (Reflection coefficients refer to the relative amplitude of the fields.) 





(9) R, = 


(11) Vertical Polarization 


For a vertically polarized incident wave 
E, = 0; H, = H, = 0. 


The boundary conditions give 


(10a) D,+D, = D,/sin 6, 
(100) E,—E, = E,/cos 8, 
(10c) H,+Hgs = H,. 


When the electric field has a component in the direction perpendicular to 
the plasma boundary, the field will cause free charges to move at right angles 
to the plane of the interface. Consideration of either a convective current or 
a polarization shows that there will be surface charge on the interface (q,), 
given by 

de = e(K—1)E,. 


Equation 10a takes this charge into account since the displacement includes 
the polarization. In terms of the electric field, the first boundary condition 
(equation 10a) can be written 


(11) E,+E,;=KE,/sin 0. 


The relation of the refracted wave to the incident and reflected plane 
waves, written in terms of £,, is 
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a. Gate —| | K,cos0+~])—j\ K,ic —F I, 
E,  2sin@cos@ eo +; J Cos +7, 


Es, 1 , ° > Pp 

z san bcasal (K+ c08 0-2) ~i( cose) | 

(12) ae Ex _ [K, cos @—(Q/k)]—jlK; cos 0@— (P/k)] 
“Ex,  [K, cos 0+(Q/k)]—j[K, cos 0+ (P/k)]’ 


where R, is the reflection coefficient for a vertically polarized incident plane 
wave. 
III.b. Amplitude and Phase of Reflected Wave 


(1) Horizontal Polarization 
The reflection coefficient may be written 











Ry = |Rnle?*™ ae Ry, +jRn- 


The phase shift on reflection (4,) is given by 





if Rn ) a | 2(P/k)cos 6 | 
= — = lé , my - ae - 5 |. 
(13) A, = tan Ge tan” | S39 (O/k)?— (P/ by? 
The amplitude of the reflected signal is given by 


, _) beat T 
(14) [Ry | seat (Rr Rn) bac [feos 6+(O/k)P+(P/k) 


where the asterisk denotes the complex conjugate. 
(11) Vertical Polarization 
For vertical polarization the phase shift on reflection (A,) is given by 
-1J__2[K.(P/k)—K,(Q/k)]cos6__\ 


(15) Ay = tan “(KEK )cos’ 0— (Q/k)?— (P/b's 





The amplitude of the reflected signal is given by 


2 = *d _ = 
(16) [Rv] = (RvRy)" = ( 6+ (O/k)]°+[K, cos 0+ (P/k)] 





[(Kr cos 6— (Q/k)]"+[K 1 cos 0— (J Ley V 


III.c. Reflected and Refracted Power 


(1) Horizontal Polarization 

The incident, refracted, and reflected waves satisfy the conservation of 
energy principle at all times, namely 

(S,—Ss)cos 0 = S_», 
where S_, is the Poynting vector in the negative x direction (into the plasma). 
Normalizing the incident power to unity allows one to write the time average 
energy as 
& 
i— RyRy - En 
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NORMALIZED REFRACTED POWER 
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where &, is the normalized average refracted energy given by 


(17) a eae 4(Q/k) cos@ | 
" [cos 0+ (Q/k)]°+ (P/k)? 





and R,Rx is the normalized average reflected energy given by 


_ [cos @— (Q/k)|°-+ (P/k)? 
[cos 0+ (O/k) |? +(P/k)° * 








(18) R,Re 


(ii) Vertical Polarization 

For vertical polarization the incident, refracted, and reflected waves must 
also satisfy the conservation of energy principle at all times. Normalizing the 
incident power to unity allows one to write the time average power as 


1—R,Ry = &. 


The normalized average refracted energy (&) is given by 
(19) baa teiae 4 cos OK .(QO/k)+Ki(P/k)] _ ’ 
¥  [K, cos 6+ (Q/k)]°+[K, cos 0+ (P/k)]° * 


For a plasma, K,(Q/k)+K,(P/k) is always positive. Therefore, & is always 
positive. The normalized average reflected energy (R,R¥) is given by 


(20 R.R* = Ey _ [K, cos 6—(Q/k) |’ +[K, cos 0@— (P/k)]” 
oe sa Ex [K, cos 6+ (Q0/k)|°"+K, cos 6+(P/k)|*° 





(iti) Variation of Refracted Power with Angle of Incidence 

The variation of the normalized average refracted energy (& and &) asa 
function of angle of incidence is shown in Fig. 4(a, b,c, and d). Figure 4a 
shows the results for a lossless plasma. Because the plasma “‘dielectric con- 
stant’”’ has values which are less than unity, the curves show a cutoff which is 
not normally encountered on the free-space side of a dielectric boundary. 

When the plasma is slightly lossy (K, = 0.06, Fig. 4b) the cutoff which is 
present in the lossless case disappears. The significance of cutoff becomes 
less meaningful as the loss increases. The effect of larger loss (K, = 1.00; 
K, = 3.00) is shown in Figs. 4c and 4d. 

Figure 5 shows the variation of & and & for various amounts of loss (that 
is, K, = 0.2 and various values of K,). The absence of ‘‘total reflection’”’ and 
the reduction of the maximum of refracted power is apparent. 


III.d. Energy Flow 
(t) Horizontal Polarization 
The Poynting vector in the plasma is 


‘ 12 Zo good e Q 2 P . 2 l 

(21) S, = |H,|"=—« ) —i\ < cos wt+— cos wt sin wt )-+k cos wt 
sin 6 E 6 6 

where i, k are unit vectors in the x, z direction respectively. The average 

over one period at the interface (x = 0) is given by 
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2 
29 § = Bel Zo )_,Q \ 
a) a Sane Cs Oe 
The average energy thus enters the plasma at an angle Vs, given by 
(23) tan Ws, = 6/Q. 


Equation 7 showed that tan y = 6/Q. Therefore, energy, on the average, 
travels at right angles to the planes of constant phase. That is §, has the 


NORMALIZED REFRACTED POWER 
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Fic. 5. The power entering various plasmas for a horizontally polarized and vertically 
polarized incident wave, as a function of angle of incidence @. As the plasma becomes more 
lossy (as Kj increases), the cutoff disappears and the maximum at the Brewster angle decreases. 


same direction as the normal to the surfaces of constant phase (Fig. 6a). 
Equation 21 shows that the Poynting vector changes direction, as well as 
magnitude, during one period. That is, the wave is not a plane wave. 

(it) Vertical Polarization 

The Poynting vector in the plasma is given by 


lz |2 2Pr 
|B, € 


=F5 ; sn? pt HlOK: cos’ wt+PK, sin® wt 
Ao Ss 


(24) S, 


+ (QK,+PK,)cos wt sin wt] +k(K, 6 cos’ wt-+K, 8 sin wt cos wt) }. 
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FG. 6a. Direction of energy travel in a plasma for a horizontally polarized incident plane 
wave. 
FG. 6b. Direction of energy travel in a plasma for a vertically polarized incident plane wave. 


The Poynting vector at the interface (x = 0), averaged over one period, is 


E,|" 


(95 sS -_ z : 
my Sy 2 Zk sin’ 


{—i(OQK,+PK,)+kéK,}. 


The average energy enters the plasma at an angle ps, given by 


6K, k sin 6 
OK,+PK, O+P(K,/K;)’ 


(26) tan Vs, = 


From equations 7 and 26 it can be seen that, in general, ~g, does not equal 
y. That is, the average Poynting vector is generally not normal to surfaces 
of constant phase. Since OK,+PK, is always positive, pg, is negative if A, 
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is negative, corresponding to a ‘‘backward”’ wave. When &K, is positive, 
P(K,/K,) is positive and Wz, is less than y. When K, is zero, there is no 
loss and if P is zero (i.e. the wave is not cut off) ws; = y. In the last case, 
the wave is a plane wave and the Poynting vector is normal to surfaces of 
constant phase. These effects are illustrated in Fig. 6b. The waves in the plasma 
become plane waves if there is no loss (K, = 0) or if the angle of incidence 
is zero (6 = 0). In these cases, the familiar Fresnel equations are obtained 


from equations 18-20. 
IIl.e. Cutoff and Total Reflection 

When there is no loss (K, = 0), either Q or P will be zero depending on 
whether K, is less than or greater than sin? 6, respectively. (When K, = 0 
and K, = sin? 6, both P and Q will be zero.) When Q is zero the wave is 
cut off. There will be fields inside the plasma given by 


+ x Pr+j(—é2z4+w!) 
- = ie vy 


These fields are quite commonly referred to as the Sommerfeld solution 
(Sommerfeld 1954). The Poynting vectors for the two polarizations are 


. 2 2P 
es 


(27a) Ss, = 7 tas { —-iPK, cos wt sin w&t+kK,6 cos’ wt}, 
ee os : 
(276) S, = rs —" {iP cos wt sin wt+ké cos’ w}. 


When averaged over one period we have 


= Eee 
28a) ,= kiss 
— S, 2Z sin 6 


(28b) S, = Zee. 
2 sin @ 

Since energy flows, on the average, only in the s direction, we have S_, = 0, 
and there is “total reflection’’. For total reflection, it is clear that, in general 
(equations 27), energy does flow normal to the interface inside the plasma. 
This ‘storing of energy" is necessary to satisfy the conservation of energy 
principle at total reflection since, in general, there will be a phase shift when 
there is total reflection. 

There is total reflection for angles of incidence which are larger than or 
as large as the critical angle of incidence. The critical angle of incidence (4) 
is given by 

= sin Viy Ey. 


When K, is not zero, the fields inside the plasma not only store energy 
but also transmit and dissipate energy. Neither P nor Q will be equal to 
zero if there is loss. That is, when the plasma is lossy, the incident wave 


will never be totally reflected. 
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IV. BREWSTER ANGLE PHENOMENA 
Complete transmission of r-f. energy into a plasma occurs when RR* is 
equal to zero. For a vertically polarized electric field this occurs at an angle 
of incidence given by 
P 
cos 6 = A. ahha sgt 
RK, kK, 
This relation has no valid solution except when the plasma is lossless. For 
a lossless plasma this corresponds to the Brewster angle (6s,) given by 
a 


(29) tan Oa; '= «/K;; 


For a lossy plasma, the transmission is not complete but corresponding 
maxima in the refracted power occur (see Figs. 4 and 5). One can therefore 
extend the definition of the Brewster angle to indicate the angle of incidence 
for which the reflected power in a vertically polarized wave is a minimum. 
No such angle occurs for a horizontally polarized reflected wave (except at 
normal incidence). 

The Brewster angle for a lossy plasma could be found by differentiating 
& (equation 19) with respect to 6 and setting the result equal to zero. In 
practice, such a procedure is cumbersome since P/k and Q/k are given by 
rather involved expressions. Since & was evaluated on a digital computer, 
it was simpler to evaluate £, for a sufficient number of values of the dielectric 
coefficient so as to indicate the variation of the Brewster angle and the 
amplitude at the Brewster angle. A plot of these parameters, on the complex 
dielectric coefficient plane, or K-plane, is shown in Fig. 7. 

Lines of constant Brewster angle are almost concentric circles centered on 
the K-plane origin. The Brewster angle decreases to zero as the origin is 
approached. As the distance from the origin increases, corresponding to larger 
absolute values of the dielectric coefficient, the Brewster angle occurs at 
larger angles and approaches 90° as the dielectric coefficient becomes infinitely 
large. Beyond the region shown, the dependence of the Brewster angle on 
the distance from the origin can be approximately determined by calculating 
the Brewster angle for a lossless plasma with the same magnitude of K. On 
the positive K,-axis, the Brewster angle is given by the simple relation for 
a lossless plasma, given by equation 29. 

Figure 7 also shows the magnitude of the refracted energy at the Brewster 
angle and at normal incidence. This allows a comparison of the magnitude 
of the refracted power at the Brewster angle with the magnitude at normal 
incidence. For example, if K, = —1.0 and K, = 1.0, the Brewster angle will 
be about 43°; the magnitude of &, at the Brewster angle will be approximately 
0.62; and the magnitude of & at normal incidence will be 0.55. 

In practice, one is interested in determining the variation of reflected and 
refracted energy in terms of the plasma parameters: plasma frequency (w,), 
collision frequency (v), and the radio frequency (w) of the incident wave. The 
variation of these normalized parameters S = v/w, N = w/w, C = v/up, 
F = w/w, on the dielectric coefficient plane is shown by Bachynski, Johnston, 
and Shkarofsky (1960). 
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Lines of constant Brewster angle are shown on the complex “dielectric” coefficient 


Fic. 7. 
plane. The amplitude of the normalized refracted power (&) at the Brewster angle and at 


normal incidence is also shown. 


V. POLARIZATION OF REFLECTED WAVE 

When a horizontally or vertically polarized wave is incident on a plasma 
boundary, the intensity of the reflected signal is easily measured, but phase 
change is not readily measured. If the incident signal has a horizontally 
polarized component and a vertically polarized component, the difference in 
the phase change between these two components results in a change in the 
elliptical polarization of the reflected wave which is relatively easy to measure 
and yields the same information regarding the plasma properties as trans- 
mission measurements do. 

The elliptical polarization of a wave can be specified by the complex 
quantity p which is related to the electric vectors parallel to and perpendicular 
to the plane of incidence (£,, Ey) by 








en reer 
= . = . io 0 

“hn Ey 
where the magnitude of p is po = |E,/F,|, and EF, leads FE, in phase by an 


angle A. 
For an incident wave of polarization given by 


A 
Pa = pae “A = 1, 
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the reflected wave is in general elliptically polarized with ellipticity given 
by 


fa = pope’ 8, 


p_ 4/ Ry "= / (1=®) 
B Re 1—§/° 


Ag = 4,—An+A,g 


where 


(30) Pop = 








where A, and A, are given by equations 13 and 15. That is, 





| ee ee ae) 


where 


a = [K, cos 0+(Q/k) ][cos @— (Q/k)]+[K, cos 0+ (P/k)](P/k), 
b = [K, cos 6+ (Q/k)](P/k) — [Ky cos 6+ (P/k)][cos 6— (Q/k)], 
c = [K, cos @— (Q/k)][cos 6+ (Q/k)]—[K1 cos @— (P/k)](P/k), 
d = —[K, cos 0—(Q/k)](P/k) —[K, cos 6— (P/k)][cos 6+ (Q/k)]. 


The expression (ad—bc) has the same polarity as p,; (ac+dd) has the same 
polarity as p,. With attention to the polarity of p, and p;, equation 31 specifies 
Az between 0 and 360°. 

The variation of pop and Ag for py = 1 is shown in Fig. 8. The effect of 
cutoff and of the Brewster angle can be seen for the lossless case. Beyond 
cutoff, where there is total reflection, pop; = 1. At normal incidence, the phase 
change is 180° (both electric components are reflected identically but the 
wave travels in the opposite direction). At the Brewster angle, the phase 
change drops to zero and stays at zero until cutoff. Beyond cutoff, the phase 
shift for the E, and £, components will be different. Ag increases and drops 
to zero again at grazing. When K, increases, the minimum value of po at the 
Brewster angle is greater than zero; the phase change becomes more and 
more gradual. For large loss, pp and Ag values become similar to those de- 
scribing metals (Born and Wolf 1959). 


VI. SUMMARY 

A plane electromagnetic wave horizontally or vertically polarized incident 
at an arbitrary angle on a flat plasma boundary can launch a horizontally or 
vertically polarized electromagnetic wave, respectively, into a uniform, iso- 
tropic plasma. The reflected wave will be plane but the wave in the plasma 
will not, in general, be plane. 

By specifying the attenuation and phase constants in the direction normal 
to the interface, rather than in the direction of the Poynting vector or the 
normal to surfaces of constant phase, the equations for the fields retain a 
simple form. 
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Numerical results showing the variation of attenuation and phase constants, 
power entering the plasma, and ellipticity of the reflected wave are presented. 
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GENERAL FORMULA FOR THE SECOND VIRIAL COEFFICIENT! 


J. VAN KRANENDONK 


ABSTRACT 

A simple derivation is given of the quantum mechanical expression for the 
second virial coefficient in terms of the scattering phase shifts. The derivation 
does not require the introduction of a quantization volume and is based on the 
identity R(z)—Ro(z) = Ro(z)HiR(z), where Ro(s) and R(z) are the resolvent 
operators corresponding to the unperturbed and total Hamiltonians Hp and 
Ho+H; respectively. The derivation is valid in particular for a gas of excitons 
in a crystal for which the shape of the waves describing the relative motion of 
two excitons is not spherical, and, in general, varies with varying energy. The 
validity of the phase shift formula is demonstrated explicitly for this case by 
considering a quantization volume with a boundary the shape of which varies 
with the energy in such a way that for each energy the boundary is a surface of 
constant phase. The density of states prescribed by the phase shift formula is 
shown to result if the enclosed volume is required to be the same for all energies. 


1, INTRODUCTION 


The correct quantum mechanical expression for the second virial coefficient 
of a gas of molecules with central intermolecular forces was derived by Uhlen- 
beck and Beth (1936). These authors showed that the second virial coefficient 
can be expressed in terms of the scattering phase shifts characterizing the 
scattering of two molecules by the intermolecular forces. The contribution to 
the second virial coefficient of the center of mass motion of the two molecules 
can always easily be taken into account, and we consider only the contribution 
from the relative motion. This contribution is proportional to the quantity 


(1) a = lim (Z—Zp), 
Voao 

where Z is the partition function of a particle of mass equal to the reduced 
mass of the pair of molecules moving in the space of the relative co-ordinates, 
in a total volume V, at the origin of which there is a potential field, 1), equal 
to the intermolecular potential between two molecules; Z» is the value of Z 
for H, = 0, i.e. in the absence of the interaction. For the sake of simplicity, 
we assume that there are no bound states of the pair of molecules. For a 
spherically symmetric potential H,, the quantity (1) is then given by 


oo sz 1 dni 
2) a= | dE ¢& ** — = 
‘ ; 7 te dE’ 
where 6B = (kT)~', and nm™(E) is the phase shift suffered by the partial wave 
of angular momentum /,m and energy E upon scattering in the potential 
field H;. 

In recent years a number of rederivations and generalizations of the result 
(2) have appeared in the literature. Fukuda and Newton (1956) discuss the 
spherically symmetric case and show that the shift in the energy of a level 

‘Manuscript received August 9, 1961. 
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belonging to the quasi-continuous spectrum of Ho, resulting when the inter- 
action H, is switched on adiabatically, behaves asymptotically for V-—> © as 
E*nim(E)V~. From this fact the relation (2) follows immediately. DeWitt 
(1956) gives a general derivation of (2) using formal scattering theory. His 
result is valid for an arbitrary interaction H, and can be written in the form 


—-{agr-#iy &™ 
(3) a= |dEe 7 GR? 


where E, \ are the quantum numbers diagonalizing the S-matrix (cf. Lippmann 
and Schwinger 1950). Finally, Goldberger (1959) has given a derivation of 
(2) using a method originally introduced by Wentzel (1942). Goldberger only 
gives the derivation for the spherically symmetric case (2); it is not obvious 
how his method can be used to derive the formula (3) for the general case. 

All these derivations have in common that, at some stage in the calculation, 
the discrete, quasi-continuous, spectrum is considered explicitly, and that the 
limit V — © involved in (1) is then taken. This transition from the quasi- 
continuous to the continuous spectrum must be handled with care. Reifman, 
DeWitt, and Newton (1956), for instance, were led to the incorrect result 
that the level shift is proportional to tan m rather than to » by assuming that 
a perturbed level is surrounded symmetrically by unperturbed levels (cf. 
Fukuda and Newton 1956). 

The purpose of the present paper is to give an alternative derivation of the 
general expression (3) based on the identity (6). In this derivation the transition 
to the continuous spectrum is made at the start, thus avoiding the necessity 
of introducing a quantization volume, and of discussing explicitly the shifts 
of individual levels. We have in mind, in particular, the application of (3) to 
the calculation of the second virial coefficient of a gas of excitons in a crystal 
(Van Kranendonk 1956). Such quasi particles move in a discrete rather than 
a continuous space, and the kinetic energy operator, Ho, of the relative motion 
of two excitons depends on the center of mass motion. The expression (3), 
however, is still valid, except that the integration now extends over a finite 
energy interval corresponding to the finite exciton energy band. The difficulty 
in deriving (3) for this case in the usual way, by considering the separation 
between consecutive energy levels with and without the interaction, in a large 
volume V on the boundary of which the wave function is required to vanish, 
originates from the fact that the shape of surfaces of constant phase of out- 
going scattered waves is not spherical and in general varies with varying 
energy. Starting with an eigenfunction vanishing on a given surface, an increase 
in the energy produces a wave function which vanishes on a surface of different 
shape, and it is not immediately obvious that in this case (3) is still valid. 
In Section 3 we discuss this question in some detail, whereas in Section 2 we 
present the formal derivation of (3). 


2. FORMAL DERIVATION OF THE PHASE SHIFT FORMULA 
The quantity @ defined by (1) is equal to 


~B(Ho+H1) 


e 


(4) a = lim try[e ome 


Viw 


’ 
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where try indicates a trace over a complete set of states in the volume V. 
Using Cauchy’s theorem, the expression (4) can be written in the form 
5 Ss in al i : 
(5) a = lim =— try { dze **[R(z) —Ro(z)], 
Cc 


Visa 2m 
where R(z) = (2—H)- and Ro(z) = (z—Ho)-' are the resolvent operators 
corresponding to H = Ho+H, and Hp respectively, and C is a path in the 
complex energy plane encircling the real positive axis in counterclockwise 
sense. The essential step in the derivation of (3) is to use the well-known 
identity 


(6) R(z)—Ro(z) = Ro(s)AiR(z). 


In virtue of the appearance of the potential H, of finite range in the right-hand 
side of (6), it is clear that after substituting (6) into (5) the limit V — © can 
be carried out, the trace being converted into a trace over a complete set of 
states belonging to the continuous spectrum, normalized according to (8). 
Denoting this trace by tr, we get 


(7) a= a J ae tr[Ro(z)HiR(sz)]. 


The trace may be written out in terms of the complete set of eigenstates of 
Hy defined by the equations 


| Hodm = Edn, 
(8) | (be'x's Ger) = bn'6(E—E’), 


| x J dE dex)(oex = 1. 


We assume that the ¢g, diagonalize the S-matrix, 
{ + ~y a + 
(en's Shen) = dy’ 6(E—E’) S\(E), 

(9) oe 
l@ ale 
| Sx(E) = exp [2rin,(E)]. 

The caret will be used to denote matrix elements on the energy shell. The 

states dg, are linear combinations of the states x, cf. (40), where the ¢, are 

plane wave states for ordinary particles and Bloch state for excitons. 

For the sake of simplicity we assume that we are dealing with a separable 
scattering problem, i.e. that S and H, are simultaneously diagonal in A, so 
that in addition to (9), we have 


(10) (ben, Hider) = dr (ber, Hider). 


We can then work entirely within a subspace corresponding to a given value 
of \. Using the notation 


(11) > = opr, ¢ = orn, etc. 


the contribution of the subspace A to (7) can be written in the form 
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9) — tf a | ’ 1 ’ , 
(12) a= 5 _ ue dE ie (¢’, Hi R(z) ¢’). 


To remove the operator R(z) from (12), we need a complete set of eigenstates 
of H. For these states we use the outgoing scattering states, y,, defined by the 
integral equation (Lippmann and Schwinger 1950), 


1 
(13) i= $+ iH 
where ¥, = W%.. In the limit « — 0 and in the absence of bound states, these 


states satisfy die following relations (Gell-Mann and Goldberger 1953), 
Hyvex = Ever, 
| Wain, Wan’) = b8(E—B’), 
J ae yenyse = 1. 
Using the states Y4 in sda we get 


(15) a= as dze * { favar EG hie 2 Fx 7 (o's Aww! )(vy, ¢'). 


By choosing the path C to encircle arbitrarily closely the relevant part of the 


(14) 


FE-axis, we get 


a= az ee J fawanrve, E"", E)(¢’, HW), ¢’), 
(16) ; 


| D(E', E", E) = —+ ((E'—E)+e(E"—E) 


mw ((E'—E) +e ][(E”—E)* +e] ° 
The limit « > 0 is to be taken after the limit «’ — 0 involved in the definition 
of the states p,. 

We must now show that the expression (16) for a agrees with the expression 
(3), or rather with the contribution of the subspace X to (3), 


- _ tage ee tee 
(17) a= fake a dE’ 


where 7 = (EZ). We shall show, in fact, that the integrands appearing in (16) 
and (17) are equal, i.e. that the following identity holds, 





(18) - dE'dE"D(E’, E"", E)(¢', Hw), 6’) 


= Wes i De at 
“it (1+2ri7 ) aE? 


where 7 is the scattering operator defined by S = 1—T7, and where 


S = 1—2ziT, with 
(19) T = TM E) = , HW), 
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cf. Lippmann and Schwinger (1950). We first eliminate the quantity (y’/, ¢’) 
from (18). From the definition (13) of ¥4, we get 


(20) Wi, 6!) = 8(B"—E") +97 (8, Hat) 


When we substitute the first term of (20) into the left-hand side of (18), and 
make use of the relation 


/ e vw Dp 0 , 
(21) DE’, E',E) = sp Greta gp ME'-E), 


we get 


(22) Sf dE‘dE"D(E’, E”, E)(¢’, Hiv{)é(E’— E”) 
= ae 
dE’ 
which agrees with the first term on the right-hand side of (18). In the same 
way, using (21), one can show that the imaginary part of the term resulting 
when the second term of (20) is substituted into the left-hand side of (18) is 
equal to the imaginary part of the second term on the right-hand side of (18). 
To show that the real parts of these terms are also equal, and thus to establish 
(18), is more difficult but can be done by using essentially only the identity (6). 
The relation to be established may be written in che following form, 


= faviey ~ 7? 8(E"— 





2° a e dk" — e(E’ ee a F’-E" ; > 
(3) r=+f faeae ((z” —EP+e(z’ —E)'+e] (e'— —E mary nl (E’, E") 


= ir(7*aT—Ta7*), 
where we have introduced the notation 
(24) S(E’, BE") = |", AW) )?-|@, AW’) |*, 


and @ = (0/d£). In the Appendix we show that instead of taking the limit 
e’ — 0 followed by «— 0, as implied in (23), one may put «’ = e, and that 
one gets the correct result by using indiscriminately the singular functions 
6(Z) and A(1/E). Since f(E’,E”) defined by (24) is a smooth function of 
Kk’ and E” satisfying f(E’,E’) = 0, we get, in this way, 


(25) J 


Il 


; , _e ” “tr _ xP xR , tt 

J J dE'dE"5(E”—E) PE Bop te i) 
P F FF 

= fae’ ROE Ee’ h(E’ E) = fanye’ E)@ EE 


ar fant’ se oe P| dE’ ee 


To evaluate the integrals occurring here, we introduce the scattering operator, 
T(z), for arbitrary complex z by means of the equation (Watson 1956), 











1568 CANADIAN JOURNAL OF PHYSICS. VOL. 39, 1961 

(26) T(z) = H,+H,R,(z)T(z), 

which has the solution (Chew and Goldberger 1952), 

(27) T(z) = H,+HR(z)M, 

which is a consequence of the identity (6). From (27) we get 
\(o, Ay 

(28) (6, T@)4) = (6, the)+ faz Mee 


Putting 2 = E+7e, and remembering that 
(29) (o, 7 +6) = = (9g, H,2,¢) = = (9, Hy,) = i y; 
(cf. Gell-Mann and Goldberger 1953), where 7, = 7(E-+7e), we get from (28) 


(30) a? fan Ke lo, Hy hv )l = 4($, Hi¢)—Re 97. 


We also remark that, taking the imaginary part of (28) for z = E++7e, we get 
the optical theorem 

(31) Im T = —2|T/? 

Next we take the off-diagonal element of (27), 


A, ,Hi¢" 
(32) (¢, Tz , = (9, H,o")+ J dE » (o,f Av ss id ze: 
Differentiating this equation with respect to £, putting 2 = E+7ze and 
E” = E, and taking the real part of the resulting expression, we obtain 


. <i ( e\j2 ; 
(33) P| dE! eae = a(¢, Hid)—2 Re[(1+ix7*)(0¢, Hiv,)]. 


To evaluate the remaining two integrals occurring in (25), we consider (26), 
from which we get 


ee ; vie T(z 
(34) (Ws, T(2)¢) = (Ws, Hie) +f dB MW 1o XG Oe 
Putting 2 = £++7e, differentiating the resulting expression with respect to £, 
and using (31) gives 


(35) OP | dE’ ab = —d(~,, MiW.)+Re a7. 


Finally we get from (26) 
(36) (V4, 1) 6") = Wa Hid”) + faz ethene, ee) 


Differentiating with respect to E and putting z = E+7e, LE” = E gives 


(37) p | apr 2G fide) = —d(p,, HiW,)+2 Re[(1+in7*)(¢, May,)]. 


e 
3 
& 
& 


x 
a 
e 
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Using (30), (33), (35), and (37) in (25) yields 
(38) I = ix(T*aT—Tat*), 


which is the desired result. This concludes the derivation of the identity (18) 
and hence of the phase shift formula. 


3. DISCUSSION OF THE BOUNDARY VALUE PROBLEM 


In this section we give an explicit demonstration of the phase shift formula 
(3) for the case where the shape of the scattered waves depends on the energy. 
As discussed in Section 1, this circumstance arises for a gas of excitons in a 
crystal. We shall show how in this case the boundary value problem can be 
treated so as to lead to the correct value for the density of states. For the sake 
of simplicity we assume that the motion in the space of the relative co- 
ordinates of two excitons is described by a continuous variable, r, rather than 
by a discrete variable corresponding to the discrete lattice structure. 

Let ¢, denote the unperturbed plane wave state corresponding to the wave 
vector k: 


(39) Hodx(t) = Exo (r). 


The operator Hy and the eigenvalues Ey in general depend on the center of 
mass motion of the two excitons (cf. Van Kranendonk 1956), but this cir- 
cumstance does not introduce any difficulties. The states diagonalizing the 
scattering operator are given by 


(40) or (fr) = » fy,(k)5(Ex—E) ox (8), 


where \ denotes the same quantum numbers as in (3), and the f,\(kK) are the 
eigenvectors of the reaction operator (cf. Lippmann and Schwinger 1950). By 
carrying out the integration over k in (40) for large r, it can be shown that the 
asymptotic form of the waves (40) is given by 
(41) aatetos piek) sin[k(w, E)r-+é(o, E)], 
where @ = r/r, and k is the length of the vector k = kw reaching from the 
origin to the surface Ey = E. The quantities A, and 6, can be expressed in 
terms of Ey and f\(K), but we need not write down these expressions. In the 
case of spherical symmetry, we have \ = (/,m), 6. = —4$lx, and Ay and & are 
independent of o. 

Consider the surface in r-space defined by the equation 
(42) k(w,E)r+6(o,£) = C, 
where C is a constant. For large C this surface is evidently a surface of constant 
phase of the wave @g,. We again consider only waves of a given type A, and 
we suppress the subscripts A. For a given value, £, of the energy we can choose 
C = nr with na very large integer, and ¢g(f) is then an eigenfunction of Ho 
belonging to the eigenvalue “, which vanishes on the surface defined by the 


equation 
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_ nr—d(o, E) 
8) "= ke, E) 
Let the surface (43) be denoted by S;, and let V be the volume enclosed by Sj, 
71 J ey 
(44) V= 3 ko, E) dw. 


We now increase the energy E and at the same time deform the surface S, 
so as to retain at all E a surface of constant phase, given by (42), and in such 
a way that the same volume V is enclosed. This requires that the increase in 
E be accompanied by a corresponding increase in C. Let E’ be the value of 
the energy for which C has in this way increased from uz to (n+1)z, and let 
S. be the resulting surface, 


(45) , = ntl) r—so, F’) 
i. k(o, E’) 


¢x' (Fr) is an eigenfunction of Ho belonging to the eigenvalue £’, which vanishes 
on the surface S2. Since S; and S2 enclose the same volume V, and since ¢z-(r) 
has one more nodal plane than ¢,(r), it is reasonable to assume that AE = 
E’—E is equal to the separation in the energy spectrum of Ho corresponding 
to the volume V and the given value of \, where E and EF’ are defined by the 
equations (43) and (45) together with the boundary conditions ¢g(5S,) = 
¢z'(S2) = 0. Similar considerations hold for the spectrum of H = Ho+A,. 
We now show that this assumption leads in fact to the correct expression (3) 
for a. 

From the expression (44) for V we see that, in the limit V > © and fora 
given value of the energy, the quantity m is of order V3. We expand nz in 
descending powers of V}, 


(46) nx = Vi +e.+0( v-). 


The coefficients can be obtained by substituting (46) into (44). This leads 
to the following expressions for c; and C2, 


(47) a= (3/A)}, c2 = (D/A) 
where A(E) and D(E£) are defined as 
(48) A =fk(w,E)“*do,  D = fio, E)k(o, E)~*do. 


To obtain an expression for AE, and hence for the density of states, p() = 
(AE), we consider the equation 
"Jnx—d(w, E)| oa "J (n+1)4—5(w, E+ AE) @ 
rhea ) = ieee rr rere - ; 
\ k(@, E) k(@,E+4E) f 
expressing the fact that the surfaces S; and S» defined by (43) and (45) res- 
pectively enclose the same total volume V. We first expand AE in powers 


(49) 


of n=, 
(50) E =an"+bn?+... 
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Substituting (50) in (49), we get 
(51) a = —3(A/A’), b = —3(B/A’), 


where the dashes denote differentiation with respect to E, A is given by (48), 
and B(E) is defined as 


(52) B= Su ~24+| 37 1) ¥ yl a(Z) -! -] ae dw, 


where y = 5/x. Combining the expansions (46) and (50), we get the desired 
expansion of AE in descending powers of V!. Denoting quantities relating to 
the spectrum of Ho by a subscript zero, we get in this way for the density of 
states, po(EZ), corresponding to the total volume VY and the given value of X, 


(53) po(E) = (3V/A)*(ma)-!+ (Do/a A) — (b0/a?) +0(V-4). 


We next evaluate the density of states, p(£), in the spectrum of H. The only 
difference is that in the asymptotic form (41) of the scattered waves, 6 is 
replaced by 6+7, where » = (£) is the scattering phase shift. In the limit 
V — o, we evidently get 

D—Do_b—bo 


(54) Rt hy aes 


where D and b are obtained from Dy and bo by replacing everywhere 6 by 6+7. 
Using (48) and (52), we easily find 


(55) D—D> = An, m(b—bo) = an—a’n’. 
Combining (54) and (55) yields finally 


, 


(56) p(E)—po(E) = =n, 


i 
T 
which is the correct expression for the difference in the density of states caused 
by the interaction H, in the limit V — . The quantity a@ defined by (1) is 
equal to 

(57) a = {dEe~®*[p(E) —po(E)], 


and the result (56) is thus seen to lead directly to the correct expression (17) 


for a. 
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APPENDIX 


We must evaluate the integral 


1 x «(E’— E'-E"” 
A.l) I= 1 ff awan" 9 ae pon 78 
ue ‘ [(@"—B) +e] Ly +e] Ey te" 

Xf(E’, E”), 
where the limit «’ — 0 followed by « — 0 is to be taken. The integration over 
E” runs effectively over an interval of order of magnitude ¢ around E. Putting 
E” = E+xe, we have x = O(1), and the integral J can therefore be written 
in the form 






$I AIE IAT ER SEG PNT antag 


ie ee 
(Aa) 4 oe fan Sef ak’ pa (E as *t-¢? (ett) ne he 


where 2 = E’—E-—ie’. The integration over x can be carried out, giving 
(A.3) I fat EE E- 3 f(E’, E). 
- ot = ‘die ee ieee 
(E’—E)’+é (E’—E)"+ (ete) 
Since f(E,E) = 0, the limit e’ — 0 is finite, and we get 


[= Z az e’— OEE aegis 


Le = Bye ; 

l fe y IEE) Uf yy) JE) 
a ee Pf af a Be ee eer ne 5 
oe En 60 2 | 


The last term is equal to o. giving 


at ce 2) j of (E’, E) 
ae pete a ee Ce Fie ree 
ray Jae cs wee 


which is the result used in the text, cf. Eq.(24). 
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ON THE STATISTICAL PROPERTIES OF SOME 
TRANSPORT EQUATIONS! 


ADRIAN E, SCHEIDEGGER 


ABSTRACT 


It is shown that, whenever a system is a linear combination of a large number 
of fluctuating component systems and in the ‘‘large’’ system a certain quantity 
is a constant of the motion, under equilibrium conditions that quantity is canoni- 
cally distributed in the component systems and, under non-equilibrium conditions 
(assuming linear regression of the fluctuations and microscopic reversibility), the 
quantity in question is subject to a diffusivity equation with a symmetric 
diffusivity tensor. Application of the above to porous media hydrodynamics 
immediately yields the equations for miscible displacement which were laboriously 
obtained earlier from a special statistical model. Similarly, the corresponding 
equations for suspended sediment transport in rivers and for the mixing of a 
pollutant in air are shown to be the result of an application of the above general 
statement. Finally, the theory is also applied to the inertial frequency range in 
the statistical theory of turbulence. 


1. INTRODUCTION 


In many geophysical applications we encounter transport equations in 
which the transported quantity is subject to a diffusivity equation. Such 
transport equations occur, for example, in connection with the turbulent mixing 
of pollutants in air (Sutton 1943), in connection with the transportation of 
suspended sediment in rivers (cf. Scheidegger 1961), and in connection with 
miscible displacement processes in porous media (Scheidegger 1960). 

In all the above cases, the distribution of the quantity in question (pollutant 
concentration, sediment content, miscible fluid saturation) is subject to a 
diffusivity equation which is superimposed upon the main motion of the fluid. 
In the individual cases, one can arrive at the diffusivity equation by making 
microscopic models of the process. However, the outcome of these models is 
the same in every case so that one might ask himself whether there is not a 
common element in the statistical assumptions involved in the individual 
cases. It will be shown that this is indeed the case. 

In all cases in question, there is a quantity for which there exists a conserva- 
tion law. Any such quantity, if fluctuating in a small part of a large system, 
will be subject to a canonical probability distribution in that part of the 
system. Thus, it is possible to define ‘‘thermodynamic’”’ functions where the 
quantity in question plays an analogous role to that of temperature in ordinary 
thermodynamics. 

Under non-equilibrium conditions, Onsager’s theory leads to a diffusivity 
equation for the quantity in question just as it does for the conduction of heat 
with regard to the temperature in ordinary non-equilibrium thermodynamics 

Any quantity for which there is a conservation law (and for which certain 
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statistical assumptions with regard to the fluctuations can be made) is macro- 
scopically subject to a diffusivity equation. It thus turns out that the statistical 
hydrodynamics in porous media as discussed by the writer (Scheidegger 1954) 
is but a special case of a more general statistical transport theory. The same 
theory also leads immediately to the diffusivity equations in the other cases 
that were mentioned above. 


II. TRANSPORT WITH MIXING 


The transport equations with which we shall concern ourselves are 
characteristic of the phenomenon of mixing. 

In all such cases the mean flow of the fluid is described by some kind of an 
equation of motion, superimposed upon which there is a diffusivity equation 
(with a mass transport term) describing the mixing. In other words, the field 
of (mean) flow velocity v(x,t), x denoting the space co-ordinate, ¢ time (bold- 
face denotes vectors), is given by one set of equations, superimposed upon 
which there is an equation describing the concentration y (x,t) of a tracer 
within the flow. 

For our purposes, we shall assume that the equations of motion for the mean 
flow have been integrated so that the velocity field v is known. Furthermore, 
we assume that, as far as the mean flow is concerned, we have steady state 
conditions so that v is a function of the position co-ordinates x only. Then, 
the concentration y of a tracer is described by the following diffusivity equation 


(2:4) SY a * oo Zz a Duzer ~ he USER 


j = = 


where it is assumed that the summation convention applies. The quantity D;,; 
is a tensor which describes the amount of mixing that is taking place; this 
tensor may itself depend on x; as well as on 2. 

The reason that the above diffusivity equation describes the mixing is an 
outcome of the assumption that the mixing is a random process. Then, the 
deviation of a tracer particle from its mean position is completely random 
during any one time step. If what happens during one time step is independent 
of what happened previously (no autocorrelation), then, no matter what the 
distribution of deviation during one single time step is, the Central Limit 
Theorem of probability theory states that after many time steps, one will end 
up with a Gaussian distribution for any one tracer particle. Equation (2.1) is 
an equivalent expression of the resulting Gaussian distribution in differential 
form. 

The above argument has been detailed, for instance, by the writer for 
the dispersion of a tracer in a porous medium (Scheidegger 1960). 


Ill. PHASE SPACE 
The description of the mixing processes given in Section 2 represents a 
rather direct application of statistical mechanics to the ‘‘random walk” of a 
tracer particle within the main stream of a fluid. The writer is not aware that 
ensemble statistics as introduced by Gibbs into the kinetic theory of gases 
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have been applied to mixing processes as considered here, although attempts 
are not lacking. The reason for a difficulty arising in such attempts is that, in 
Gibbs’ theory, the basic step is to introduce an energy function (the Hamil- 
tonian expression) which is a constant of the motion. The processes considered 
by us, however, are highly dissipative so that the energy is no longer a constant 
of the motion; thus, the analogy with the kinetic theory of gases is broken at 
this point and one cannot simply follow Gibbs’ scheme to arrive at expressions 
for the entropy, etc. 

It is obviously necessary to find a constant of the motion which has to take 
the place of the energy in Gibbs’ scheme. Since there is a law of conservation 
of (tracer) mass during the mixing process, it seems obvious to seek such a 
constant in connection with the total amount of tracer that is available. 

Thus, let us consider a volume V that travels with the main stream of the 
fluid. The co-ordinates of individual points of this volume are described by the 
vector x’(x’,y’,z’) where it is assumed that the primes denote co-ordinates 
that are measured from an origin which is travelling with the main stream of 
the fluid. Furthermore, we assume that this volume is tsolated from the rest 
of the fluid, i.e. that no mass exchange takes place across its boundaries. 

One may ask himself whether such a volume can actually occur in nature. 
It is easy to see that it is approximated in sediment transport in a river with 
uniform flow (approximately horizontal), since the transverse mixing of the 
sediment particles is the only mixing that is of significance (the longitudinal 
mixing may be neglected). Any two vertical planes in the river and travelling 
with the river will then bound a volume of the required type. In porous media 
physics a corresponding case can also be envisaged if it be assumed that the 
longitudinal dispersion is zero. 

Once a volume V of the required type has been defined, it is clear, from the 
principle of conservation of mass, that the total amount of tracer contained 
in it must remain constant: 


(3.1) fv W(x',y',2’)dx'dy'ds’ = H = const. 


where ¥ denotes tracer distribution as in the last section. The equilibrium 
conditions are given by the asymptotic solution of the tracer equation (2.1); 
let us denote this asymptotic solution by y,,. 

Equation (3.1) bears out a certain analogy with similar equations in gas 
dynamics. In order to strengthen this analogy, we now split our volume V 
into N cells of volume V;. The cells are indexed by a set of numbers of x’,y’,2’ 
which we shall denote by 2. 

Thus, the 7th cell will contain the amount y; of tracer substance 


(3.2) vi = fv, (ry 2’ )dx'dy'ds’. 
Equation (3.1) becomes then 

N 
(3.3) H= > ¥.. 


In analogy with the Hamiltonian expression for energy, we define new variables 


p; as follows 
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(3.4) bi = +vXi 
where the p;’s are always defined as positive quantities. Then we have ‘i 
N a 
(2.5 2 
(3.5) H=)> pi 
i=1 


where H is now analogous to the energy in gas dynamics. We define variables 
gi, which are conjugate to the p,’s, by the following well-known conditions 


(3.6) di = 0H/dp, = 2p, 
(3.7) bi = 0H/dq; = 0. 

It turns out that the p,’s are constants of the motion 
(3.8) Pi = %. 

Hence the g,’s are linear functions of time 

(3.9) qi = 2a; + const. 


The p;’s and q;'s define a phase space; any system under consideration is 
represented by one point in this phase space. Because the variables p,, g; have 
been defined so as to satisfy the Hamiltonian equations of motion, Liouville’s 
theorem is valid for this phase space. 

In addition to the phase space describing the tracer distribution over the 
whole volume I, it is often convenient to consider subspaces which refer to 
one value of the index 7 only. 

The above equations describe completely static conditions, i.e. the tracer 
amounts in the individual cells are constants of the motion. Since eq. (2.1) 
indicates that these tracer amounts may change, we have to introduce weak 
interactions between the various cells. We assume that this weak interaction 
can be described by adding an interaction function U into the Hamiltonian 
expression (3.5) for the total amount of tracer: 


N 
(3.10) H= D> pi+U(q... qn). 


i=] 
Then, eq. (3.6) still remains, but (3.7) is changed to 


dH As ou 


(3.7a) bh, = - = — : 
om) Pi oq: Ogi 


We shall generally leave the function U unspecified and simply assume that 


it represents a weak interaction. 


IV. EQUILIBRIUM STATISTICS 
Now that we have defined a phase space, it is possible to define ensembles. 
These ensembles will be attained under equilibrium conditions because of the 


presence of the interaction terms. 


PPTEUP NS 
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Following Gibbs’ scheme, we assume a microcanonical ensemble for the 
density function P in the phase space for the complete system consisting of 
the N cells 


(4.1) P = const. for H(p;, qi) between E, E + dE, 
P = zero otherwise. 


As long as the Hamiltonian is essentially given by (3.5), but with a very weak 
interaction being present, then it is well known that the component systems 
are represented by a canonical ensemble 


oi =. 

(4.2) P= Zz é a0 
where Z is the partition function for the ensemble 
(4.3) Z= f e840. d bdq. 

For a canonical ensemble, the thermodynamic functions are defined as 
follows: 
(4.4) F= — : InZ 

; 8 , 
4.5) T= m3 
( ov eit kB , 
(4.6) S=- a =—k | P \n Pdpdq. 


In gas theory, F is usually called the free energy, 7 the temperature, and 
S the entropy. Furthermore k is some constant. 
For our canonical ensemble, it is easy to evaluate the partition function. 


We have 
(4.7) H = p’. 


Hence, assuming that the integration over q vields a constant C, we have 


(4.8) Cz= f da, 

2 é poi e |x 
4.¢ Z=C id = | Fe r= — A/ 
(4.9) Z=( J dp V8 do é- as > | 3 


a“ 


(note that the domain of p is 0 < p < —). 


At equilibrium, the tracer concentration in each cell must be y,,. This must 
be equal to the average concentration in the cell. The latter is 
1 Vr 
ie. 
8 4 


C \ e 5d p = | 3 


(4.10) Vv. a p ae 
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It is seen, thus, that the ‘‘temperature’’ is related to the mean concentration 
in the cell. If we set the constant 





(4.11) k = 2, 

then 7 is directly related to the tracer concentration in question 
(4.12) Pm zs = Yo 

Thus 

(4.13) P= —2y,(K + 3 In y,,) 

with 

(4.14) K = In CvV1/V2 

and 

(4.15) S = — dF/ay,, = Iny, + 2K +1. 


The above relations describe the physical phenomena in a very satisfactory 
manner indeed. 

The average concentration in a cell over time is connected with the modulus 
of the canonical ensemble; it is directly analogous to the temperature in a gas. 
The entropy can be calculated. 

The tracer concentration in many cells is also subject to a canonical distribu- 
tion, provided all these cells are physically identical. This is true in mixing 
processes in a porous medium if gravity is neglected, or in a river for cells at a 
constant level above the bottom. 


V. NON-EQUILIBRIUM STATISTICS 

The statistical properties of the transport equations discussed thus far are 
concerned with equilibrium conditions and lead to rather obvious results. 
Of great importance is an analysis of the conditions that obtain when the 
system under consideration is not in equilibrium. 

In this instance, the present analysis is exactly analogous to the conduc- 
tion of heat in solids, inasmuch as the tracer density is exactly analogous to 
the temperature in the latter case. 

It is therefore possible to simply rely on the theorems of non-equilibrium 
thermodynamics as they were developed by Onsager and others (see De Groot 
1951). Accordingly, if a quantity is microcanonically distributed over a large 
system which consists of many fluctuating components, then, in equilibrium, 
the component systems will locally be canonically distributed. If there is no 
equilibrium, then the various component systems may still be regarded as 
locally canonical although the parameters characterizing these canonical 
distributions will not have equilibrium values. A theory of “irreversible 
thermodynamics” can be constructed predicting how these parameters will 
attain their equilibrium values. 
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To be somewhat more specific, let us consider the following simple case. 
We take two boxes of equal volume which are in contact with each other but 
which are otherwise isolated. The first box has a tracer density y’, the second 
y’’. Then, the flow of tracer during a time element dt into y’ is dy’ and the 
flow of tracer into y’’ is dy’’ so that 


(5.1) dy’ = — dy”. 

The increase of entropy dS per unit time dt is thus (cf. 4.15) 
: dS_ 1[dy’' ae dy’ ( 1 ) 
5.2 — = =) -| = ——l[-, — — 
(5.2) a = E + y" at y y" : 

Now, we set 

(5.3) dS/dt = JX, 

with 

(5.4) J = dy’ /dt, 

(55) xa te aE, 


if y’ and y”’ are not too different. At this stage it is common to introduce the 
assumption of linear fluctuation regression which leads to 
(5.6) J = LX. 

It is well known that, if there are several boxes so that heat flow can take 
place in several directions, L has to be replaced by a symmetric tensor Ly, 
(Onsager’s relations). 

The above analysis leads to a heat conductivity equation for the quantity y 
if the latter depends on the position in space. This generalization has been 
carried out, e.g. by De Groot (1951, p. 41 ff.) for the conduction of heat in 
solids, and it is therefore not necessary to repeat the details here. The condition 
(5.6) becomes 


(5.7) J; = Zz LiyX j. 
J 
In terms of more commonly used quantities, this can be written 
S oy 
(5.8) “u;,=— > Dy a 
j Xj 


where u; is the flux of the quantity y. However, y, as a constant of the motion 
for the large system, is subject to a continuity equation 


- oy Ou; 
5.9 P=-> —, 
(5.9) ot 7 Ox; 
and thus we obtain 
7 oy 0 oy 
0. 10 = - ) — 
(9 ) at » x ax, Z ij Ox 


which is a diffusivity equation. 
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The Onsager relations for D;; yield (cf. De Groot 1951, p. 41 ff.) 


(5.11) 2. OD ij _ =, aD ij 

wie, Ox, 7 Ox; 

It is obvious that in the equation (5.10), an antisymmetric part of D;,; (because 
of 5.11) cannot be observed and hence D;; may be postulated to be symmetric. 

Provided that the statistical assumptions which were made above are 
retained (linear regression, etc.), it is quite obvious that the above deduction 
of the diffusivity equation is quite general and independent of any of the 
special assumptions about a particular transport process. One can therefore 
state the following general theorem: 

Whenever a system is a linear combination of a large number of fluctuating 
component systems, and in the ‘‘large’’ system a certain quantity is a constant 
of the motion, then under equilibrium conditions, that quantity 1s canonically 
distributed in the component systems and, under non-equilibrium conditions 
assuming linear regression of the fluctuations and microscopic reversibility, the 
quantity in question 1s subject to a diffusivity equation with a symmetric diffusivity 
tensor. 

It is obvious that an application of the above theorem immediately explains 
the occurrence of the diffusivity equation in porous media hydrodynamics (in 
miscible displacement) for which it was laboriously obtained earlier (Schei- 
degger 1954) from a special statistical model. Similarly, the corresponding 
equations for suspended sediment transport in rivers and for the dispersion of a 
pollutant in air are also seen to be the immediate result of the above general 
theorem. 


VI. APPLICATION TO THE STATISTICAL THEORY OF TURBULENCE 

The statements made in the previous sections are quite general; the applica- 
tion thereof to the various transport equations is fairly straightforward. 

In conclusion, we wish to consider a possible application to an additional 
case which might hold some interest: this concerns the statistical theory of 
turbulence. 

In homogeneous, isotropic turbulence, the energy E can be considered as a 
constant of the motion if the viscous dissipation is neglected. The energy is a 
special case of the general correlation tensor R,; (x) which is defined as follows 


(6.1 ) R(x) = u,(a)u,(a + Xx) 
where u; refers to the local velocity and the average is taken over all values of a. 
The energy is 
3 
(6.2) E=3 >  R,,(0). 
i=1 
It is customary to introduce the Fourier transform ;,; of R;;, which is 


(6.3) by = as | Rix 


Or 


—ikx 


3 
ax: 





ree ae 
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The inverse of this formula is 
(6.4) Ri = [Oy(k)e™d?k. 
Thus, the energy can be expressed as follows 


(6.5) E=}3)0 Ri(0) = fa Dd Si(k)d"k = fzdoa’e 


with 


(6.6) E(k) = 4 2) %:(k). 

It is often convenient to assume that ®(k) depends on || only, thus 
(6.7) E = fE(k)d*k = fE(\k|)4ak’dk = [&(k)dk 

where 

(6.8) &(k) = E(\k|)4rk?. 


It is now clear that the total energy of the system is the sum of the energies 
in ‘‘cells” in k-space. These individual energies represent fluctuating quantities. 
Thus, if there was no energy dissipation, it would presumably be possible to 
regard the large system as microcanonically distributed and the individual 
“cells” as canonically distributed. Under non-equilibrium conditions, one 
would then expect that E is subject to a diffusivity equation in k-space, viz. 

: OE 0 OE 
(6.9) — D> 2» Ok, om aE, ; 

It will be noted that the above is true only if E is a constant of the motion. 
It is well known that, in turbulence, an ‘inertial subrange’”’ of k-space can be 
defined in which the above assumption holds. In other words, energy is supplied 
at low values of k, and it ‘‘flows’’ without dissipation to high values of k where 
it is dissipated (cf. Batchelor 1953, p. 121 ff.). The process is thus entirely 
analogous to the flow of heat where on one side (low k) heat energy (turbulent 
energy) is supplied which flows to a sink (dissipation) at the other side (high &). 

For a steady state, Batchelor gives the following dependence of the energy 
on k (Batchelor 1953, p. 122) 

(6.10) &(k) = const. k-5/8, 


This is compatible with our diffusivity equation (note that dE/dt = 0). 
If Dy, is assumed as isotropic, we have in polar co-ordinates 


= div Deorad FE = 2 2- pp GEM 
(6.11) 0 = div D grad oso ape P ab 
This yields 
(6.12) kD ane = const., 


which is compatible with (6.8) if we set 


(6.13) D = const. k8/8, 


So LA ASI a tec 


iA) Pabeod® 
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The behavior of the inertial range in the statistical theory of turbulence, 
thus, can also be explained in terms of fluctuation theory. In non-steady state 
flow, the complete diffusivity equation has to be used: 


(6 14 Oe 0 pe a(& /k° ) 
D, eens a OY Nr a ca 
) at Ok ak 
with @ being some constant. 
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AN ABSOLUTE THERMAL RADIATION CALORIMETER! 
J. H. McGuire 


ABSTRACT 


To allow the absolute calibration of field and laboratory radiometers in the 
range 0.1 to 1.5 cal/cm? sec, a continuous flow radiation calorimeter has been 
constructed, based on a design originally developed by the British Joint Fire 
Research Organization. The temperature differ nce established between a water 
outlet and an inlet is sensed by a thermistor Lridge and the radiation intensity 
measurement is made in terms of the level of the electrical power, applied to a 
built-in immersion heater, which is necessary to reproduce the same temperature 
difference. 


Radiative heat transfer is an important mechanism of fire spread, particu- 
larly from one building to another, and although much information is 
already available on this question, new problems continually arise. To permit 
further studies by the Fire Section of the Division of Building Research, 
National Research Council, it has become essential to be able to measure 
appropriate levels of radiation intensity. 

The lower limit of the range involved is associated with the threshold 
intensities at which common materials will ignite. Results already obtained 
by the British Joint Fire Research Organization show that pilot ignition (i.e. 
ignition in the presence of a small flame one inch or so from the irradiated 
surface) will occur at 0.15 cal/cm? sec for fiberboard and at 0.3 cal/cm? sec 
for wood and many other cellulosic materials. The lower limit of the desired 
range is thus of the order 0.1 cal/cm? sec. 

The upper limit is determined by the feasibility of protecting materials so 
as to increase the threshold levels of intensity at which ignition will occur. 
British work on the effect of fire retardant paint on fiberboard has shown that 
the threshold intensity for pilot ignition can be raised to 1.0 cal/cm* sec. If the 
effect of the treatment on timber were to increase the threshold intensity for 
ignition by the same factor, which is unlikely, it might be thought that investi- 
gations with intensities of the order of 2 cal/cm? sec would be essential. 
Experimental measurements carried out during building fires, however, show 
that such an investigation is not immediately called for. Although the radiation 
levels at window openings in building fires can rise to as much as 3.6 cal/cm? sec, 
levels rarely exceed 1.5 cal/cm? sec at ranges beyond which flames will not be 
expected to extend. Flame impingement is likely to give greater levels of heat 
transfer than a radiation level of between 1.5 and 3.6 cal/cm®* sec, and as far 
as the protection of materials is concerned the effect of flame impingement is 
thus the more interesting subject for investigation. The upper limit of radiation 
intensity required to be covered by the Fire Section’s instrument was therefore 
set at 1.5 cal/cm? sec. 


'Manuscript received July 29, 1961. 
Contribution from the Division of Building Research, National Research Council, Ottawa, 
Canada. 
Issued as N.R.C. No, 6534. 
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Suitable field and laboratory radiometers have been constructed for the 
use of the Fire Section but no calibration facilities have been found con- 
veniently available. It was therefore decided to construct an absolute radio- 
meter for the calibration of radiometers in the range 0.1 to 1.5 cal/cm? sec. 


CHOICE OF INSTRUMENT 

When the British J.F.R.O. began its work on the ignition of materials by 
radiation in the early 1950's, it was faced with the same problem of lack of 
calibration facilities and had to develop its own design of absolute radiometer 
(Lawson and McGuire 1953). The instrument is of the continuous flow calori- 
metric type, and measurement is made of the electrical power required to 
produce the same equilibrium temperature conditions as are produced by the 
radiation level in question. The instrument has proved reliable and sufficiently 
accurate to meet the needs of fire research. 

Before adopting this design a literature survey was carried out to determine 
whether any other type could be as readily constructed. It soon became 
apparent that little or no interest has been shown in the past in the measure- 
ment of infrared radiation levels in the range 0.1 to 1.5 cal/cm? sec. 

The instrument which comes nearest to meeting the requirement is that 
used by Coblentz (1915-1916) at the National Bureau of Standards during 
the First World War. It consists basically of a blackened electrical conducting 
strip, behind which is mounted a radiation thermopile responding to thermal 
radiation from the back of the strip. The first step in measurement is to 
record the output of the thermopile when the blackened strip is exposed to a 
level of radiation. Radiation is then cut off and electrical heating of the strip 
substituted. When the level of the electrical heating has been adjusted to 
produce the same output from the radiation thermopile, the electrical power 
input is taken as a measure of the radiant flux which was previously incident 
on the strip. Coblentz was undoubtedly using a radiation source with a 
temperature of the same order as that of the radiant panel used for the experi- 
mental fire work carried out at N.R.C. The nature of his instrument, however, 
indicates that the level of the intensity which he measured was very much 
lower than the intensity near his source. Although it was the object of his 
work to determine the intensity at the source, he derived this by the inclusion 
in his expressions of a very substantial geometrical factor. 

The accuracy of the instrument, as used by Coblentz, involved very careful 
assessment of the relative areas involved in the heating, both by radiation 
and by electric power. If the instrument were redesigned to allow direct 
measurement of intensities within the range 0.1 to 1.5 cal/cm? sec, the strip 
and thermopile would become very sensitive to draughts. Work on the develop- 
ment of field and laboratory radiometers at N.R.C. has already shown the 
magnitude of this problem. After comparison of these problems with those 
associated with the J.F.R.O. design it was decided not to proceed with the 
Coblentz approach. 

In 1937, Guild described a temperature drift radiometer used for a lower 
range of intensities. It is thought, however, that the temperature drift 
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technique can only be considered superior to a continuous flow calorimetric 
approach when the range of intensities is so low that the temperature differences 
associated with the latter technique are too small to allow accurate measure- 
ment. No simple alternative approaches being available, it was decided to 
construct a constant flow calorimetric type radiometer. 


DESCRIPTION OF APPARATUS 

The apparatus is of the continuous flow calorimetric type, using water as 
the coolant fluid; radiation incident upon the calorimeter produces a tempera- 
ture difference between the water inlet and outlet. Measurement is made in 
terms of the electric power which must be supplied to an immersion heater 
to reproduce this same temperature difference. This technique substantially 
eliminates errors associated with the various indeterminate heat losses. 

The essential component of the apparatus is illustrated in Figs. 1 and 2. 
It consists of a calorimeter surrounded by a water-cooled guard ring which 
ensures that the incident radiation, the level of which is to be measured, falls 
only on the front face of the calorimeter. Factors determining the dimensions 
of the calorimeter are discussed in Appendix A; they are primarily associated 
with the source of radiation used, a gas-fired radiant panel 1 ft sq. The dimen- 
sions of the guard ring are not critical but must be adequate to ensure that 
various components and pipework behind the calorimeter are shielded from 
radiation. 

A constant rate of flow of water is established in the calorimeter by the self- 





Fic. 1. Radiation calorimeter, radiant panel, and shields. 
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Fic. 2. Radiation calorimeter. 


stabilizing arrangement illustrated in Fig. 2. Variations in the hydraulic 
resistance of the calorimeter and its associated feed system can only give rise 
to transient variations in the flow and cannot influence the equilibrium rate 
of flow. The vent illustrated in Fig. 2 was included to eliminate air locks in 
the system. 

The temperature difference which is established between the water inlet 
and outlet is sensed by the two thermistors illustrated in Fig. 2. The function 
of the length of tubing between the calorimeter and the outlet thermistor is 
to ensure adequate mixing of the heated water. Previously it had been found 
that when the thermistor was located very near the calorimeter it gave 
fluctuating readings. The design of the thermistor bridge, together with the 
choice of rate of flow of water, which is a related decision, is discussed in 
Appendices B and C. The circuit is given in Fig. 3. 

Considerable care was taken to ensure that the differential sensitivity of 
the bridge was not substantially influenced by variations in the ambient 
temperature of the inlet water. This was achieved partly by the design of the 
bridge, as discussed in Appendix C, and partly by shielding the water supply 
piping as is illustrated in Fig. 1. 

The electrical immersion heater included in the calorimeter consists of 
No. 26 B and S Nichrome A wire wound on a plexiglas former. Its resistance 
is approximately 7 ohms. Electrical power is supplied by a lead-acid battery. 
A choice of several battery potentials between 6 and 36 volts is available and 
fine control of current is given by two rheostats. Current and potential 


2S PREIS te A 


20 HE NEY RRC ND 


epee 


e 
i 
c 
Be 
' 


eer 


oer 


$0 POORER IS 


ALESSI RI IT” 





McGUIRE: RADIATION CALORIMETER 1587 


390 
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Fic. 3. Circuit diagram of thermistor bridge. 


differences are measured with precision electrical meters and the connections 
between the voltmeter and the immersion heater are separated from the power 
leads to ensure that the potential difference measurement relates to the 
immersion heater proper and does not include the potential drop in the power 
supply leads. No correction is called for as regards the power consumption of 
the voltmeter as it only amounts to a small fraction of 1 per cent of the 
measured power consumption. 

The electrical instrumentation associated with both the immersion heater 
and the thermistors is housed in a single unit. 


PERFORMANCE OF THE CALORIMETER 
The apparatus has been used to calibrate several of the Fire Section’s 
radiometers in the range 0.1 to 1.2 cal/cm* sec and a typical calibration is 
shown in Fig. 4. The response time of the thermistor bridge has been found 
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lic. 4. Typical calibration of a radiometer. 
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to be about 30 seconds, and a measurement including the substitution of 
electrical for radiant heating can be completed in 3 minutes. 

Contrary to expectations no problems resulting from air coming out of 
solution from the water were encountered during the course of a measurement. 
This probably resulted partly from the use of a wetting agent in the water to 
reduce the likelihood of the formation of large bubbles and partly from the 
fact that the time constant of the meter incorporated in the thermistor bridge 
circuit was about 10 seconds so that any short-term fluctuations were inte- 
grated. 

The variability of measurements with the calorimeter is small and has in 
fact proved difficult to measure. No assessment can be made from the cali- 
bration given in Fig. 4 as the variability in this case could be largely attributed 
to the radiometer. Total radiation pyrometer measurements also indicate that 
the level of radiation emanating from the panel is not sufficiently constant 
to allow a reasonably accurate direct determination. The variability is of the 
order 1%, and for the calibration purposes for which the calorimeter is 
required this factor becomes unimportant, first because a calibration is bound 
to involve a number of measurements and secondly because the absolute 
error associated with the instrument is of the same order. 


SOURCES OF ERROR 


The principal, and in fact the only, appreciable source of error lies not with 
the use of the apparatus but in the value assumed for the emissivity factor 
of the receiving surface of the calorimeter. No measurement can be more 
accurate than the accuracy to which this quantity is known and if an incorrect 
value were assumed, the use of the apparatus would give no indication of it 
unless the assumed value were in error by at least 10%. 

The surface in question has been coated with carbon black, the emissivity 
of which is not greatly dependent on the wavelength of the incident radiation 
and is high so that the greatest possible water temperature differential is 
established. Hottel (Chemical Engineers Handbook 1950; Stoever 1941) has 
reviewed emissivity measurements and gives the value for candle soot as 
0.95+0.01. It agrees substantially with the results of Coblentz’s work (1913a, 
b, 1915-1916) early in the century. Although Hottel’s figure is given to an 
accuracy of 1% it is possible that the practical difficulties of applying the 
coating to the calorimeter surface give values of emissivity that may differ 
from the above by say 2%. 

The only other basic source of error in the apparatus would lie in the 
measurement of the electrical power but since this merely involved the measure- 
ment of d-c. voltage and current it can be considered reliable. 

Sources of error associated with the small heat losses from the inner calori- 
meter to the atmosphere and to the outer calorimeter are reduced to a very 
low order by the measurement technique of substituting electrical power 
equal to the level of radiant energy received. The magnitude of the heat 
losses must correspond closely when the same rates of energy are supplied 
either by radiant heating or by the electrical heater immediately behind the 
front face of the inner calorimeter. The thickness of the front face, which is 
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of copper, is sufficiently small that no appreciable temperature differences are 
ever established across it. Different cooling conditions could arise owing to 
the fact that the outer guard ring is irradiated only when the heating is by 
radiation. The flow of water through this guard ring is sufficiently great, 
however, that the radiant heating does not establish any appreciable tempera- 
ture difference between the water outlet and inlet. 

It is nevertheless desirable that the extraneous heat losses from the inner 
calorimeter should not be of the same order of magnitude as the heat transfer 
by mass flow of the coolant water and this question was investigated experi- 
mentally by injecting a known level of electrical power and comparing it with 
the heat dissipation via the water. The temperature difference between the 
water outlet and inlet was made with differential thermocouples rather than 
with the thermistor bridge, which has never been calibrated. To reduce the 
effect of stray thermal e.m.f.’s, which can be significant when using thermo- 
couples to measure small temperature differences, and to maximize the 
extraneous heat losses being investigated, a high water temperature difference 
was established by using a high power input and a low water flow. With a 
temperature difference of about 20°C, at least five times higher than the 
usual maximum, two measurements gave the ratio of the heat flow via the 
water to the heat input as 95.4 and 98.3%. The accuracy involved in the 
experimental method is obviously poor but the results justify the conclusion 
that the extraneous heat losses are small. 

As discussed earlier, the sensitivity of a thermistor bridge can be dependent 


on ambient temperature. This could introduce error associated with pre- 
heating of the supply water to the inner calorimeter when it is exposed to 
radiation. To eliminate this source, care has been taken to ensure that the 
inlet water supply is carefully shielded from radiation under both sets of 
heating conditions. In addition, the thermistor bridge has been designed to 
reduce the dependence of the differential sensitivity on the ambient tempera- 
ture of the water. 


CONCLUSION 
The radiation calorimeter described in this report permits the calibration 
of a radiometer in the range 0.1 to 1.5 cal/cm? sec, using a gas-fired panel 
1 ft sq. as a source of radiation. 
The accuracy of the apparatus is of the order of 2% and the variability of 
readings is lower. A better knowledge of the emissivity factor of the receiving 
surface would improve the accuracy. 


ACKNOWLEDGMENT 
Acknowledgment is due to P. Huot of the Fire Section for the construction 
of the apparatus described in this report and for helpful advice on the elimi- 
nation of various unforeseen technical problems. 


REFERENCES 


CHEMICAL ENGINEERS HANDBOOK. 1950. (McGraw-Hill), p. 485. 
CoBLENTz, W. W. 1913a. Natl. Bur. Standards Bull. 9, 283. 
19136. Natl. Bur. Standards Bull. 9, 7. 











1590 


CoBLENTz, W. W. and EMerson, W. B. 1915-1916. Nat!. Bur. Standards Bull. 12, 503. 

GuILpD, J. 1937. Proc. Roy. Soc. Ser. A, 161, 1. 

Lawson, D. I. and McGuire, J. H. 1953. A radiation calorimeter for the absolute measure- 
ment of radiation intensities between 0.4 and 12.5 watts/cm?. Joint Fire Research 
Organization (D.S.I.R. and F.O.C.), Fire Research Note No. 37/1953, March 1953. 

McGuire, J. H. 1953. Heat transfer by radiation. Dept. of Scientific and Industrial 
en (DSIR) and Fire Offices’ Committee, Fire Research Special Report No. 2, 





























CANADIAN JOURNAL OF PHYSICS. VOL. 39, 1961 


M.S.O. London. 
SToEVER, H. J. 1941. Applied heat transmission (McGraw-Hill), Table IV, p. 214. 


APPENDIX A 
THE PERMISSIBLE SIZE OF THE RECEIVING SURFACE 


The radiation calorimeter is used to calibrate radiometers which have a 
receiving area only } in. in diameter. The radiation calorimeter measurements 
must therefore refer to such a small area. 

For convenience in the construction of the calorimeter a larger receiving 
area is desirable and the variation of the intensity over a larger area must 
therefore be investigated. The apparatus will be used in conjunction with a 
radiant panel 1 ft sq. Configuration factors relating to rectangles are, how- 
ever, cumbersome and both radiating and receiving areas will therefore be 
assumed to be plane disks. 

Using the nomenclature 

R = radius of radiating disk, 
mR = radius of receiving disk, and 
nR = separation between the two disks, the two disks being normal to a 
common axis, 
the integrated configuration factor F relating the two disks is (McGuire 1953): 


B= tnt pm? $1 — [(1 — m?)?42n"(1+m®) 40h 
Z 


Assuming that m <1, the expression approximates to 


, R'm? j m at \ 
~ wl UU - ai + wap 
The mean configuration factor at the receiving disk, relating to the mean 
intensity, will thus be 
if m* _\ 


an ere Sates ye 


The configuration factor on the axis will be 
fe = 1/(? +1); 


fe will relate to the field radiometer to be calibrated for which the value of m, | 
when used in conjunction with the radiant panel 1 ft sq., will be very small. | 

The fractional error in the intensity measurement due to the finite area of 
the calorimeter will thus be 







—_ n?m?/(n?+ 1 )?. 
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Since this error is a function of m which must essentially remain a variable, 
the application of an error correction would be inconvenient. It would be 
more convenient to ensure that the error is always small. 

Differentiation of the above expression shows that the error will be greatest 
when » = 1 and consideration of the temperature of the panel and the radiation 
level required shows that this position will be used for calibration purposes. 
This condition has therefore been assumed in calculating the error which is 
shown in Fig. 5. 

To ensure that the error is appreciably less than 1% it is thus desirable 
that the dimension of the receiving surface should not exceed 2 inches. 


MAX FRACTIONAL ERROR (n=!) PER CENT 





° ! = 3 4 5 
DIAMETER OF RECEIVER, INCHES 


Fic. 5. Measurement error introduced by finite area of receiving surface. 


APPENDIX B 
DESIGN OF THERMISTOR BRIDGE 


The design of the thermistor bridge is dependent on the range of temperature 
difference to be established between the water inlet and outlet of the calori- 
meter. It was decided that this should lie within the limits 1° C and 10°C, 
the lower limit being set by considerations of the accuracy of differential 
temperature measurement and the upper limit by the desirability of eliminating 
possible second-order effects of stray cooling. First-order effects of stray cooling 
are already eliminated by the adoption of the electrical power substitution 
method of measuring the radiant intensity. It was thought that water flows 
and temperature differences could be maintained within the limits given in 
Table I. 
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TABLE I 
Water flow and temperature difference ranges 


Radiation intensity Water flow Temperature difference 
2, 





Limit (cal/cm? sec) (cc/sec) CC) 
Upper 1.5 10 4.0 
5 lee 


Lower 0.1 1.5 








The most appropriate measuring circuit in which the two thermistors can 
be incorporated is the simple Wheatstone’s bridge as illustrated in Fig. 6. 


E 


mt R(1+d) 
(THERMISTOR) (THERMISTOR) 


Fic. 6. Wheatstone’s bridge. 











To simplify the expression for sensitivity it has been assumed that the 
thermistors are perfectly matched and that at ambient temperature the four 
arms are of equal resistance. Some improvement in sensitivity at the expense 
of power consumption could be achieved by the substitution of low resistances 
in the two upper arms of the bridge. 

The sensitivity of the bridge of Fig. 6 is given by 















6E 


’« ~ R(4+38) +27, (2+5) 
where the significance of the symbols is indicated in Fig. 6. 

The power dissipation in the thermistors must be limited to some value 
W such that the temperature will not be raised excessively by ohmic heating. 
This dissipation is given to a close approximation by W = E?/4R. Substituting 
for E, in the expression for ig, gives 








In general, therefore, if 7, is not of a higher order than R, the bridge will 
be more sensitive for lower values of R. 
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A convenient thermistor for this application is the Veco 32A11 thermistor 
with a resistance of approximately 2 kQ at 25° C and a dissipation constant 
of about 5 mw/°C temperature rise under the flow conditions considered here. 
If a temperature rise of 0.4° C is acceptable a dissipation of 2 mw can be 
tolerated. 

Using a 50-va meter with a resistance of about 1.8 kQ, the current values to 
be expected for the conditions discussed above and the two extremes of 
radiation intensity listed in Table I are 37 wa and 17ya. 

To realize a practical bridge a 500-ohm potentiometer was included in the 
upper arm. of the bridge, as shown in Fig. 3, to allow balancing of the bridge. 
The circuit of Fig. 3 also includes a resistor in series with the battery. The 
function of this resistance is discussed in Appendix C. 


APPENDIX C 
REDUCTION OF THE EFFECT OF AMBIENT TEMPERATURE 


The resistance of a thermistor can be approximately represented by the 
expression 
z = ae’!t, 


z= 


Where one particular type of thermistor is considered, e.g. the Veco 32A11, 
the value of 6 does not vary greatly between thermistors. This being so, it is 
to be expected that variations in ambient temperature will not greatly affect 
the zero reading of a bridge of the type illustrated in Fig. 3. It is, of course, 
important that the set zero potentiometer should be located as illustrated. 

With a view to reducing the dependence of differential sensitivity on 
temperature consider the inclusion of a resistor S in series with the battery 
as shown in Fig. 7. 

The expression for the sensitivity is 
ER(e1—23 


te = (g)-+22) (27,R+R’ + SR+ Sry) +2122(2R+S)+RS(R+2r,) * 
E 





Fic. 7. Circuit with external series resistor. 
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Substituting 2; = ae’/” and 2. = ae’/(7+® and making the approximations 
(2; —22) = (ab0/T*)e"’, 

(21+22) = 2ae’/’, 
and ZZ = are??/7, 
sac se os) alo Th ne 
€  Qae’’* (2rgR+R°+SR+ Srg) tae!" (2R+S)+RS(R+2rQ) ' 
Differentiating with respect to 7 and equating to zero gives the condition 


—2Rz"b+8r,RT2+4R°T2+4RTs° 


S = StS ease esmeeeaeeee te age eee dea apa 


~ bz’ — 272” — 4T2R — 4Tar, — bR® — 2Rbr, — 2R°T — 47,RT 


where z = ae’?!’, 
In the bridge under discussion 


R =z = ry, (at normal operating temperature) 


making these approximations 


Substituting the numerical values R = 2 kQ, b = 3,495° K, and 7 = 300° Kk, 


Inclusion of a resistor S = 372 2 should therefore, at temperatures in the 
region of 27°C, eliminate the dependence of bridge sensitivity on ambient 
temperature. 





So FRR A ET re ON EE OR ERNE: 





Be 


, 
| 
i 
b 
; 
j 
% 
: 
: 


SO RE ERT OR ERITH RS 


ARR NEN DIRE ESRI 


reece ier 


eR PSC 


ner nen oe oe 





HIGHER-ENERGY GAMMA RAYS IN THE DECAY OF 
NEPTUNIUM-239' 


R. D. Connor? AND D. R. MAcKENZzIE® 


ABSTRACT 


In the decay of neptunium-239 only two y-rays with energies in excess of 
335 kev have been detected. These have energies of 440+10 and 490+10 kev 
and intensities of 3.6+0.9X1075 and 2.0+0.5X10~ photons per disintegration 
respectively. A search at high resolution failed to reveal the associated conversion 
lines in the 6-spectrum. 


INTRODUCTION 


The principal energy levels of Pu®*® are now very well established as a result 
of studies of the B-decay of Np”*® (2.34 days), the electron capture decay of 
Am” (12 hours), and the a-decay of Cm* (35 years). There is, in the literature 
on the 6-decay of Np”, a report of two weak y-ray transitions of energy 
440 and 490 kev and of intensity 1.6X10~ and 1.9X10~ photons/ disintegra- 
tion, respectively (Lefevre e¢ al. 1955): The present work sought to confirm 
these transitions and to determine their places on the existing level scheme 
using scintillation counters and the Chalk River r+/2 8-ray spectrometer at 
high resolution. 


EXPERIMENTAL PROCEDURE 

The gamma-ray spectrum of Np**® was determined using a 5 in. X4 in. 
Nal(TI) crystal optically coupled to a Du Mont photomultiplier type 6364. 
The resolution at 660 kev was 10.5%. To reduce the background to a con- 
veniently low level, the counter assembly was heavily shielded by lead except 
for a narrow central channel to the crystal. The spectrum was analyzed after 
amplification by a transistorized 100-channel pulse-height analyzer. The 
source was placed 10 centimeters in front of the crystal and energy calibrations 
were made using the well-known radiations from Cs'7 (660 kev), Na® (511 
kev), and Hg?® (279 kev). The 6-ray work will be discussed later. 


SOURCE PREPARATION 

The carrier-free Np**® used for all sources was prepared by separation from 
samples of U8 (uranium highly depleted in U***) which had been irradiated 
in the NRX reactor for approximately 2 days. The irradiated material was 
dissolved in nitric acid and converted to chloride. The neptunium in the 4- 
valent state was separated out by two successive fluoride-hydroxide cycles 

1Manuscript received August 8, 1961. 
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using lanthanum as carrier. Neptunium was then separated from the lanthanum 
by means of an anion exchange column (Dowex-1). Samples of active solution 
obtained by this procedure were checked for purity by following their decay 
rates in a methane proportional 6-counter for at least 10 half-lives. One sample 
originally 107 c.p.m. was followed for 20 half-lives and found to have a residue 
of 80 c.p.m. of long-lived activity. 

Sources for the 8-spectrometer were prepared by sublimation from a boat- 
shaped tantalum filament at 1800° C through a slot 0.020 in. wide by 0.5 in. long 
onto a backing of 800 ug/cm? aluminum foil. For the y-counting, sources were 
prepared by direct transfer of active solution to standard aluminum counting 
trays using micropipettes. 


RESULTS 
(a) Gamma-Ray Studies 
The y-ray spectrum of Np** was examined using lead absorbers up to 
49.0 g/cm*. The spectrum obtained with a 27.0 g/cm? lead absorber is shown 
in Fig. 1. Here the intense low-energy portion of the spectrum has been biased 
out by the analyzer. Only two peaks, at 440+10 and at 490+10 kev, are 
found between the 335-kev y-ray and the total disintegration energy of Np”*® 


10° 









m 


NUMBER OF COUNTS 


3 


250 





350 





kev 
cl a le lh nn 

20 40 60 80 100 
CHANNEL NUMBER 





Fic. 1. y-Ray spectrum of Np*® filtered through 27.0 g/cm? lead. 
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(724 kev). Their decay was followed for 3.5 days and found to agree with the 
period of Np***. These results are completely in accord with the findings of 
Lefevre. 

The unresolved group of lines of lower energy in Fig. 1 is known to consist 
of four y-rays of energy 335, 316, 286, and 278 kev respectively and their 
relative quantum intensities are 4.4, 3.3, 1.4, and 31.1 respectively (Ewan 
et al. 1957). If these figures are corrected for absorption in lead and for varia- 
tion of the efficiency of the detector with energy, the resulting relative inten- 
sities are those appropriate to the area under the composite group in Fig. 1. 
Hence the areas under the photopeaks of the 278-, 440-, and 490-kev gamma 
rays can be obtained. 

Recorrection for lead absorption and detector efficiency yields the relative 
intensities of emission of the three y-rays in question. The absolute intensity 
of emission of the 278-kev y-ray has been determined as 0.133+0.015 photons 
per disintegration (Connor and Fairweather 1959), while a value of 0.136 with 
an error of about 10% is obtained if the relative intensities of transitions to 
the ground state rotational band of Pu®*® (Ewan et al. 1959) are coupled with 
the observed intensity of 10.5% of the 8-feeds to this rotational band (Connor 
and Fairweather 1959). 

Using the latter figure the absolute intensities of the 440- and 490-kev 
radiations appear as 3.6+0.9 X 10~ and 2.0+0.5 X 10~ photons per disintegra- 
tion. The rather large errors quoted reflect the sensitivity of the final result 
to the values of the absorption coefficient used. The absorption coefficients 


for this geometry have been examined experimentally for y-rays of energy 
279 kev (Hg?) and 511 kev and have been found to agree closely with the 
theoretical values without coherent scattering (Grodstein 1957). Consequently, 
the values employed in these calculations were obtained by graphical inter- 
polation of these theoretical values. 

The relative efficiencies of photopeak detection by the Nal crystal were 
determined after the manner suggested by Mott and Sutton (1958). 


(b) Beta-Ray Spectrometer Studies 

In an attempt to obtain the energies of the 440- and 490-kev y-rays with 
greater precision, the appropriate regions of the 8-spectrum were searched for 
conversion lines using the 71/2 spectrometer at Chalk River at a resolution 
of 0.05%. No conversion lines corresponding to either of these transitions 
could be detected. The Z III line of the 335-kev y-ray was clearly identified 
although it is weak. The absolute intensity of this E1 y-ray can be determined, 
from the earlier works mentioned, as 0.0192 photons per disintegration. The 
L III internal conversion coefficient (Sliv and Band 1958) is 5.710-, thus 
the number of LZ III conversion electrons is 1.09 X10-> per disintegration. 

Had the K-conversion lines of the 440- and 490-kev y-rays been as intense, 
it is probable that they would have been observed. It is concluded that the 
missing conversion lines have intensities <10-° electrons per disintegration 
of Np**. Table I shows the expected number of conversion electrons per 
disintegration for the transitions El, E2, £3, M1, M2, and M3 using the 
K-conversion coefficients of Sliv and Band (1958). 
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TABLE I 


Predicted A-conversion electron intensities (per disintegration) 








y-Ray Assumed multipolarity 
caegy Quantan ———— — ~ - 
(kev) intensity El E2 E3 M1 M2 M3 





440 3.6X10-° 5.41077 1.4X10-* 3.6X10-* 1.2X10- 3X10 6.1X10-5 
490 2.0X10-§ 2.41077 6.3X10-7 1.6X10-* 5.0X10-§ 1.2X10-5 2.4107 


We see that the gamma rays in question could be El, E2, E3, or M1. 


DISCUSSION 

Whereas the energies of the two y-rays are in accord with the determination 
of Lefevre et al. (1955), it will be seen that the intensities here reported are 
much less. The discrepancy would appear to arise from the fact that approxi- 
mately equal photopeak heights were obtained for these y-rays with an 
8.12 g/cm? lead absorber (Lefevre) while in the present work 27.0 g/cm? 
was needed. When a lead absorber of thickness approximately 8 g/cm? was 
used in the present experimental arrangements the 440- and 490-kev y-rays 
were too weak to be examined satisfactorily. We have not been able to find a 
means of reconciling the two sets of measurements. 

Energy considerations would lead to the expectation that the transitions in 
question terminate on the ground state rotational band (K = 1/2) (Fig. 2). 
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The initial level, assuming both originate from a common state, may be either 
the 511.9-kev or a hitherto unreported level at about 500 kev. In either case 
the photopeaks may each be double, the 440-kev peak representing transitions 
to the 75.71- and 57.27-kev levels, the 490 peak representing transitions to 
the 7.85-kev level and the ground state. This last transition may be relatively 
weak. If these transitions occur as indicated, it may well explain the greater 
intensity of the 440-kev y-ray, but neither peak appears unduly broad when 
compared with neighboring calibration lines. 

The Nilsson diagram (Mottelson and Nilsson 1959) shows that intrinsic 
states with spins 7/2 and 5/2 are to be expected for levels in the region of 
500 kev, and it has not been possible so far to indicate unambiguously which 
is appropriate to the 511.9-kev level. It is therefore possible that another level 
exists in this region having a spin of 5/2 or 7/2. Taking the predominant tran- 
sitions as those to the 57.27- and 7.85-kev levels, the energies found in both 
investigations would indicate a level just below 500 kev, i.e., at about 497 kev. 
On the other hand, if the initial level is that at 511.9 kev, there must be a 
systematic error in the determination of the photopeak energies, both here 
and in the earlier work of Lefevre. 
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STUDIES IN CHROMATOGRAPHIC TRANSPORT 
I. A SIMPLIFIED THEORY! 


A. P. TuDGE 


ABSTRACT 
A simplified but accurate treatment of the theory of adsorption chromatog- 
raphy is presented. The resulting transport equations (which apply to single 
and multiple adsorbates) are easy to comprehend and apply. Several useful 
methods are discussed for determining quantitatively the transport of con- 
centration shocks. 
INTRODUCTION 

Many authors have published papers on the theory of adsorption chromatog- 
raphy. The early contributions were made by Wilson (1940), DeVault (1943), 
Weiss (1943), Offord and Weiss (1949), Glueckauf (1946, 1947, 1949), and 
Sillén (1950). In the last few years important contributions have been made 
by Baylé and Klinkenberg (1954, 1957a, 19576), Smit and van den Hoek 
(1957a, 1957b), and Klamer and van Krevelen (1958). This previous literature 
has presented, in principle, the solution to many of the important problems 
in chromatographic theory as it applies to adsorption processes. However, in 
all of these publications the equations for chromatographic transport were 
quite complex. This was caused mainly by the presence, in these equations, 
of many variables with dimensions of length, mass, and time. In addition, 
different authors have used different dimensions for the same variable. 

In the present publication a considerable simplification is achieved by the 
use of dimensionless variables. In addition, several useful methods are given 
for the quantitative determination of the velocity and the position of a con- 
centration discontinuity, i.e., a concentration shock. Thus, it is our purpose 
to present a new treatment of the theory of adsorption chromatography which 
is much easier to comprehend and which can be applied more readily to 
practical problems. 













ASSUMPTIONS 







The assumptions inherent in our treatment are common to all ‘‘ideal 
equilibrium theories”. These are: 

(1) the adsorbate in ‘“‘solution” is always in equilibrium with the adsorbed 
adsorbate; 

(2) in the direction of flow no dispersion of the adsorbate occurs by such 
irreversible phenomena as diffusion, channeling, or eddy mixing; 

(3) the adsorbent column is homogeneous with respect to porosity and state 
of subdivision throughout its entire length. 
The theory of ideal equilibrium chromatography is very useful since it can 
serve as an excellent starting point in any attempt to develop the more 
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difficult theory of nonideal nonequilibrium chromatographic transport. In 
general, however, this latter type of transport presents considerable mathe- 
matical difficulties. Therefore, the theory of ideal equilibrium chromatography 
serves as a useful approximation to real chromatographic behavior. 

For simplicity, we have described the theory in terms of ‘absolute adsorp- 
tion’, as distinct from ‘‘relative adsorption” (see Baylé and Klinkenberg 
1957a). This does not restrict the theory to any great extent since the concept 
of ‘relative adsorption’’ applies mainly to systems of multiple liquids (see 
Smit and van den Hoek 1957a). In fact, the type of adsorption system we 
are considering has wide limits; the adsorbate may be carried along by either 
a gaseous or a liquid carrier. The only constraint is that the volume of the 
adsorbed phase must be negligible in comparison with the original volume 
of the adsorbent pore space.* 


THEORY OF CHROMATOGRAPHIC TRANSPORT 


In order to express chromatographic transport in the simplest terms, it is 
necessary to define the amount of the adsorbed phase in a unique way. Instead 
of using the dimensions of weight per unit weight of adsorbent as others 
have done, we prefer to use weight per unit volume of adsorbent pore space. By 
making this change, the Langmuir adsorption equation becomes 


| ie 
(1) C ~K+C 


where A, An, A, and C all have the dimensions of weight per unit volume 
of pore space (see List of Symbols at the end of this paper for definition of 
symbols). Both sides of equation (1) are dimensionless. 4, is proportional to 


(2) 


> 1 
(3) a) ; 


for consolidated porous media, e.g., porous rock. Thus the value of A, is 
dependent upon the state of subdivision of the adsorbent, i.e., 2/1, and on 
the packing conditions occurring in the adsorbent column, i.e., ¢. Both m/e 
and p are constant for any one system. The symbol C can be alternatively 
defined in normal concentration units, i.e., weight per unit volume of solution, 
when only one fluid phase is present. 

In a linear column of adsorbent the concentration of adsorbate can be 

*For a cylindrical pore with a free diameter (for flow of, carrier fluid) of 1.0 micron, suppose 
that the adsorbed phase reduces the free diameter by 20 A. Therefore, the cross-sectional area 
of the pore will be reduced by 0.40%. Therefore, when the pore wall is saturated with adsorbate, 
the carrier fluid will flow 0.40% faster. Below saturation the effect will be reduced propor- 


tionately. Therefore, in most real adsorption systems the change in transport behavior caused 
by the volume of the adsorbed phase is negligible. 
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considered to be a function of distance and time. Thus chromatographic 
transport is given by the following material balance equation (compare Klamer 
and van Krevelen 1958, or Baylé and Klinkenberg 1954) 


“ 2), = aa 2), 


Equation (4) can be simplified by introducing some dimensionless variables. 
If R and ¢ are constant, we define 7 as 


(5) + = Q/RL¢. 


Thus 7 is the ratio of the volume of fluid injected into the column divided by | 
the total fluid capacity of the adsorbent column; 7 is a dimensionless time 
and is equal numerically to the number of pore volumes of injected fluids. In 
addition, we define a fractional distance, s, as 


(6) See, 


a 







Substitution of equations (5) and (6) into (4) gives 
‘) acl — 
Os], Or = 
Also, since A is a function of C then (7) becomes 

ac) _ _(ac () | 
- (2) - (3), [1+ sed. 


As stated above, the concentration, C, is a function of distance, s, and time, 
7. Thus 


(7) 






C = F(s, 7) 






and 


~_(9C) , ,(9C 
(9) aC = (as ) as+-(2) ar. 


For the condition where C is constant (i.e., dC = 0), then the ‘‘velocity”’ of 
C is given by 


(10) _ (2). “ -(2) / (8). 


We can remove the condition that R (the cross-sectional area of the adsorbent 
column) must be constant if we change from absolute ‘‘velocity”’ to relative 
velocity, i.e., velocity of a concentration relative to the velocity of the carrier 
fluid. In addition, since A is a function of C only, we can replace (0A/dC),,, 
by dA/dC. The above rationale allows us to modify equation (8) further in 
order to express chromatographic transport in the simplest terms, namely, 
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For the Langmuir adsorption equation (1), the derivative dA/dC has the 
value 
(12) oe ote: 
dC K [1+(C/K)] 

In the above derivation we have assumed that no discontinuities occur in 
C and A. Equation (11), therefore, can only apply to those regions where C 
and A vary in a continuous fashion, e.g., in the trailing edge of a normal 
chromatographic band. However, it is frequently necessary to know the 
transport behavior of concentration discontinuities or shocks, e.g., in the 
sharp leading edge of a chromatographic band. For such discontinuities we 
can use the equation* 


1 
~ 14+[AQ—AwI/Co- Cu] 
where the subscripts (+) and (—) refer to the conditions immediately in 
front of and behind the discontinuity. For continuous changes in C and A, 
the following values exist: 
Co = C, Aw =A, 
Cua = C+dC, Aw = Atda. 





(13) Va 


Substitution of these values in equation (13) gives equation (11), which has 
been derived already. At the sharp leading edge of a chromatographic band 
the values of C and A are 


Ces = C; Av) = A, 
Cu) = 0, Ay = 0. 
Substitution in equation (13) gives 
(14) V,= wines Mite 
‘ 1+(A/C)* 


By means of equations (11), (13), and (14) it is possible to describe the chroma- 
tographic transport of any adsorbate no matter how complex the adsorption 
isotherm may be. Equations (11) and (14) are particularly simple, and they 
can be used easily to make mental calculations of many aspects of a chromato- 
graphic process. 

The argument presented above for a single adsorbate can be extended 
readily to cover multiple adsorbates. Many equations have been derived to 
express the adsorption of a single adsorbate. However, only the Langmuir 
adsorption equation has been modified in a simple, quantitative manner to 
describe multiple adsorbates. Success with this equation is probably due to 
the fact that it has an excellent theoretical basis. Suppose, for example, that 
two adsorbates, 1 and 2, are adsorbed simultaneously. The modified Lang- 
muir equationf is 


*Equation (13) is a rigorous material balance transport equation. Although proof is not 
given here, it can readily be shown to be true. 

tThis equation and the following are given for adsorbate 1 only. Analogous equations for 
adsorbate 2 can be obtained merely by changing subscripts. 
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(15) At a i ae 
Ci K, 1+(C1/K1)+(C2/Ke) 

for adsorbates that have approximately the same free energy of adsorption. 

Equations analogous to equations (11), (12), (13), and (14) also exist. Re- 


spectively these are 




















(16) Veo. = eer i 
) C1 1+ (8A1/8Ci)c.-+ (8A1/8C2)c,° (8C2/8Ci) ee’ 
17)" (2) dey | - hares 
0C:/c, Ki [1+(Ci/K:1)+(C2/K2)]°’ 
(18)* es) a lai ‘Ke 
9C2/ ¢, Ky, [1+ (Ci/K1)+(C2/K2)]°’ 
19 1. Seonceietenntiticie hacen athdaa 
( ) di 1+ [414 —Ar)/[Ciaa — Cray] 
(20) Ve ! 


14+(4i/Ci) 
The evaluation of (8C2/AC,);,- in equation (16) has caused controversy in 
the literature. Briefly, it can be stated that 

(a) if some definite relation, independent of s and 1, exists between C, and 
C2, viz. 


(21) Ci = F(C2), 


then (0C2/8C1);,, can be replaced by dC2/dC, which, in turn, can be evaluated; 
(b) if no relation exists, then (8C2/0C,),,, cannot be evaluated analytically 

and furthermore its value will be different at each particular combination of 

s and r. 

If equation (21) is valid, then a particular value of C; is always associated 

with a particular value of C2. These values of C; and C2 are always together, 

and they have the same velocity. Therefore, 


















Ve = Veo 


as os C2 _ ee fea) dy 
8Ci/ ce \OC7/ ¢, dC, \OC7S ce, \OCV c, dC’ 











=—— wba ~ : paar 
2 a3 7 | Ne af24s | = (2) 
| (2 C2 dC» JV dCi C2 IC» ¢, ass. 0C2 C1 ac, C2 
2( 24s) 
* dC2 C1 


Equation (22) shows that dC./dC,; has two values, one positive and one 
negative. 












*Equations (17) and (18) together are analogous to equation (12). 
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CHROMATOGRAPHIC TRANSPORT WHEN VELOCITY IS CONSTANT 


The velocity equations that have been derived above can be used to deter- 
mine the position of a concentration at any specific time. For instance, if a 
concentration entered the column at time 71, then its position, s, at time 72 
is simply 
(23) s = V(r2—7}). 


Of course, equation (23) is only valid when the velocity is constant. In a zone 
where the concentration changes smoothly and continuously, the velocity 
Ve is given by (11) and is constant throughout the column as long as the 
particular concentration, C, exists. Also, if the concentrations on both sides 
of a discontinuity are constant, then the velocity of the discontinuity, V4, as 
given by equation (13), is constant. Similarly, for the sharp leading edge of a 
chromatographic band, the velocity V; (see equation (14)) is constant as long 
as the concentration at the front is maintained at the injection concentra- 
tion, C;. Figure 1 illustrates an early stage in a chromatographic transport 
process* wherein a slug of adsorbate solution is transported by flowing solvent 


ADSORBATE 
ADSORBED 


Ci 


ADSORBATE 
IN SOLUTION 


AMOUNT OF ADSORBATE, Cand C+A 
(Weight per Unit Volume of Pore Space) 





.00 .04 .08 l2 16 .20 
RELATIVE DISTANCE, S 


Fic. 1. Adsorbate distribution in a transported chromatographic band at 7 = 0.20. 


*For this process the following numerical data apply. 
(a) The adsorption isotherm follows the equation: 
A _ An 1 _* is eae 
C K 1+(C/K) 1 °1+(C/l) 14C€ 
(b) Volume of adsorbate solution = 0.10 p.v. 
(c) Concentration of injected adsorbate solution = 6.00 weight units per unit volume of 
pore space. 
(d) Weight of adsorbate injected is 6.00 0.10 = 0.60 weight units per unit volume of pore 
space. 
(e) The leading edge and the trailing edge of the adsorbate solution enter the adsorbent 
column at 7 = 0.00 and r = 0.10, respectively. These edges are abrupt discontinuities when 
they enter the column. 
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through an adsorbent column. The curved line a@ shows the concentration in 
the trailing edge versus distance, whereas line 8 shows the total adsorbate 
distribution. At the time stage shown in Fig. 1, the concentration C, in the 
trailing edge has not yet caught up with the leading edge, i.e., a plateau 
region still exists. Therefore, the front still has a constant velocity. However, 
at some later time stage, the plateau region will vanish, the concentration 
at the front will decline below C,, and the velocity of the front will gradually 
diminish, 


CHROMATOGRAPHIC TRANSPORT WHEN VELOCITY IS VARIABLE 


The position of a concentration at time 72 can be determined very easily 
by equation (23) provided that the velocity 1s constant up to the time r2. However, 
if this criterion cannot be met then the determination of position can be quite 
difficult unless the proper techniques are used. Several authors, e.g., Weiss 
(1943), Glueckauf (1947), have dealt with this particular facet of chromato- 
graphic theory. However, the treatment has been limited in scope, erroneous 
in some cases, and always more complex than necessary. In this section we 
shall discuss several general methods for handling the transport of discon- 
tinuities when velocity is variable. These methods are simple but accurate, 
and they illustrate very clearly the transport of a chromatographic band. 

The first method is graphical and is illustrated in Figs. 2 and 3. 





(Weight per Unit Volume of Pore Space) 


RELATIVE DISTANCE, S 


AMOUNT OF ADSORBATE, C and C+A 


Fic. 2. Adsorbate distribution in a transported chromatographic band at 7 = 0.60. 


1. At the time stage 7, calculate the position, s, of each concentration, C, 
in the continuous trailing edge of the chromatographic band by means of 
equations (11) and (23). Plot s versus C to give line @ as in Fig. 2. 

2. Calculate A for each C by equation (1) or some other analogous adsorp- 
tion equation. Plot s versus C+A to give line 6 as in Fig. 2. 

3. Using a planimeter, place a vertical front at a value of s so that the 
area enclosed by 8 and the front represents the weight of adsorbate (per 
unit volume of pore space) originally injected into the adsorbent column. 


This determines the position of the front at time 7. 
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Fic. 3. Time-distance diagram of concentrations in a transported chromatographic band. 


4. Repeat steps 1 to 3 above for several different time stages and plot all 

associated values of (s, 7) in Fig. 3. Join these plotted points with a smooth 
curve. 
The solvent portion of the slug of injected adsorbate solution travels at the 
velocity of the carrier fluid, i.e., V = 1.00. Thus its transport is given by the 
straight line labelled ‘‘solvent front” in Fig. 3. The transport of the continuous 
set of concentrations in the trailing edge is represented by a fan of straight 
lines (since velocity is constant) emanating from [7 = 0.10, s = 0.00]. The 
movement of the leading edge is given by the heavy line in Fig. 3. Up to the 
point e, the leading edge has a constant peak concentration C; and a constant 
velocity. Therefore, its transport is represented by the straight line segment 
y. After 7., the concentration at the front declines below C;, the velocity 
diminishes, and the transport is represented by the downwardly curving line. 

The transport behavior of a simple chromatographic front (with no adsorb- 
ate ahead of the front) has been illustrated above by use of the graphical 
method. It is also possible to use the method of Johnson (1960) for simple 
fronts, and this will be discussed shortly. However, for concentration dis- 
continuities with adsorbate on both sides of the discontinuity, the graphical 
method of determining position is the only one which can be used. An illus- 
tration of this is given in the following publication. 

Following Johnson’s method, the relation between the weight of adsorbate 
per unit volume of pore space, W, that is injected into the column and the peak 
concentration, C, that reaches the end of the column is given by 


: C/K) | 
1+(C/K) 
However, if the peak concentration, C, is to reach only some distance s then 
we have 





1608 CANADIAN JOURNAL OF PHYSICS. VOL. 39, 1961 


ae iy | ee 
(25) W=~s sal | ; 


It is advantageous to rearrange equation (25) to give 
a ee T 

26 eS Oe pee 1B, 

i w L (C/K) 


Figure 4, which is a logarithmic plot of C/K versus (A/W)-s, illustrates the 
decline of the peak concentration at the leading edge of a chromatographic 
band as the adsorbate is moved through the adsorbent column by the action 
of the carrier fluid. The abscissa is an adjustable distance scale and the 
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Fic. 4. Peak concentration versus distance diagram for the leading edge of a transported 
chromatographic band. 


ordinate is an adjustable concentration scale. In the numerical example given 
in this paper, (4,,/W)-s equals 6.67, when s = 1.00. A vertical line at this 
value intersects the curve at C/K = 0.632. Since K = 1.00, we obtain the 
result that the peak concentration at the end of the column will be 0.632. The 
vertical line also intersects a horizontal line drawn at the injection concen- 
tration, i.e., C; = 6.00. If the same log scale as the one used for the abscissa 











TUDGE: CHROMATOGRAPHIC TRANSPORT. I 1609 


is placed along this horizontal line, we obtain distance directly as a percentage 

of the total column distance. Figure 4 shows that the injection concentration 

is maintained until the leading edge has gone 20.4% of the column length. 

Thereafter the peak concentration declines according to the curve. 
DISCUSSION 

In Figs. 1 and 2 the complete adsorbate distribution—adsorbed, in solution, 
and total amount—is clearly shown. This method of presentation has advan- 
tages. It is consistent with material balance, and it allows the position of the 
adsorbate leading edge to be determined when the peak concentration is below 
the injection concentration. Other authors have failed to give a complete and 
accurate picture. DeVault (1943), Sillén (1950), and Klamer and van Krevelen 
(1958) showed only the distribution of the adsorbate in solution in their dia- 
grams; Wilson (1940), Weiss (1943), and Glueckauf (1946) showed only the 
amount adsorbed. Glueckauf (1947) showed the total adsorbate distribution but 
did not indicate the components which contribute to this total. Moreover, 
his diagrams showed the adsorbate distribution in the trailing edge by curves 
which were concave downwards. This was misleading in certain respects 
because Figs. 1 and 2 (this paper) clearly show that curve 8 can have an 
inflection point during the early stages of the chromatographic development 
when the adsorbate is near the inlet end of the column. 

The transport equations given in this paper are easy to use. For instance, 
if the relative velocity, Vc, of a concentration is 1/[1+(dA/dC)], then it is 
obvious that 1+(d4/dC) pore volumes of carrier fluid will be required to 
move the concentration from the start to the end of the column. The value 
of dA/dC is given by the slope of the tangent to the adsorption isotherm at 
the concentration C. Similarly, in equation (14), A/C is the slope of the 
chord to the isotherm between the origin and the point where concentration 
equals C. 

Figures 3 and 4 are valuable aids in understanding the chromatographic 
transport process and in making rapid calculations. 


LIST OF SYMBOLS 


A = Weight of adsorbate contained in an incomplete mono- 
layer upon those surfaces of the adsorbent which are 


associated with unit volume of pore space. [ML-?] 
Am = Same as A except a complete monolayer. [ML-3] 
C = Concentration of adsorbate (weight per unit volume 
of pore space). [ML-*] 
C; = Concentration at which the absorbate is injected into 
the adsorbent column. [ML-3] 
K = Constant in Langmuir adsorption equation (weight 
per unit volume of pore space). [ML-3] 
L = Length of adsorbent column. [L] 
p.v. = Abbreviation for pore volume. (Note: by definition, 


the adsorbent column contains one pore volume of 
available pore space.) [dimensionless] 
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Q = Total volume of all fluids injected into the adsorbent 
column. ({L?] 
R = Cross-sectional area of adsorbent column. [L?] 

s = Fractional distance (distance from point of injection 
divided by length of adsorbent column). [dimensionless] 

V = Velocity of the adsorbate divided by the velocity of 
the carrier fluid. [dimensionless] 

W = Weight of adsorbate per unit volume of pore space 
that is injected into the column. [ML-*] 
x = Distance from point of injection. [L] 
p = Density of a solid particle of adsorbent. [ML-] 
P = Bulk density of consolidated porous media. ([ML-3] 

m/o = Weight of one molecule of adsorbate divided by the 
area of adsorbent surface covered by one molecule. [ML-2] 
~/M = Surface area per unit weight of adsorbent. [L?M-?] 
7 = Dimensionless time (r = Q/RL@). [dimensionless] 

¢@ = Fractional porosity (volume of the pores divided by 
the total volume of the adsorbent column). {dimensionless} 
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STUDIES IN CHROMATOGRAPHIC TRANSPORT 
II. THE EFFECT OF ADSORPTION ISOTHERM SHAPE! 


A. P. TuDGE 


ABSTRACT 


When an adsorbate with a complex adsorption isotherm is transported 
chromatographically, concentration shocks, either stable or unstable, can 
occur. Methods are given for determining the transport behavior of each type 
of shock. Mistakes in the literature are corrected. 


INTRODUCTION 


Various authors have published on this particular facet of chromatography. 
Both DeVault (1943) and Weiss (1943) realized that the shape of the adsorp- 
tion isotherm has a major influence upon the chromatographic transport of an 
adsorbate. They discussed simple isotherms, i.e., those with no inflection 
points, such as the Langmuir and the Freundlich as well as those that are 
either linear or convex upward. Although their treatments of simple isotherms 
were qualitatively and quantitatively adequate, their mathematical equations 
describing transport were fairly complex. A simple but accurate treatment of 
the theory of adsorption chromatography was given by Tudge in Part I of 
this series on chromatographic transport. 

The next advance was made by Glueckauf (1947). He described the influence 
of a complex isotherm, i.e., one with a point of inflection, upon chromato- 
graphic transport. As will be discussed in this paper, Glueckauf’s treatment 
was only qualitatively correct. Certain of his equations and conclusions which 
relate to the velocity of concentration discontinuities were erroneous. Klamer 
and van Krevelen (1958a) also described the influence of a number of complex 
isotherms found in ion exchange (Klamer, Linssen, and van Krevelen 1958; 
Klamer and van Krevelen 19580). Their publication (1958a@) repeated the 
error of Glueckauf (1947) and, in addition, contained a number of mistakes 
and omissions due, no doubt, to poor proofreading. 


CHROMATOGRAPHIC TRANSPORT OF AN ADSORBATE WITH 
A COMPLEX ISOTHERM 

In Part I (Tudge 1961), Tudge has given a simplified treatment of the 
theory of adsorption chromatography. The relative velocity* of transport was 
given by 
— — 1 —— = 

1+[A-Aw)/ACo- CoH] 
where the subscripts (+) and (—) refer to the condition immediately in front 
of and behind some plane perpendicular to the flow axis. Equation (1) is 





(1) Va 


‘Manuscript received May 8, 1961. : 
Contribution from the California Research Corporation, La Habra, California, U.S.A. 
*Velocity relative to the velocity of the carrier fluid. 


Can. J. Phys. Vol. 39 (1961) 
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directly applicable to concentration discontinuities. If the discontinuity is a 
simple chromatographic front, i.e., with no adsorbate ahead of the front, then 
equation (1) reduces to 
1 
2 Veo" = 2 
@) ‘"TF4/0) 


However, equation (1) can also apply to regions where the concentration 
changes gradually and continuously, and then it reduces to 


: 1 
(3) Ve=THGA/dO ° 
By means of equations (1), (2), and (3), it is possible to describe the chromato- 
graphic transport of any adsorbate no matter how complex its adsorption 
isotherm may be, provided, of course, the assumptions discussed in the pre- 
vious publication apply. In order to illustrate this point consider the hypo- 
thetical, but not improbable, isotherm shown in Fig. 1. 
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Fic. 1. Hypothetical complex isotherm. 





O = Origin point of isotherm (Cy = 0). 
G and H = Points of inflection (d?4y/dCy? = d?Ag/dC,? = 0). 
J = Point where tangent and chord to isotherm have equal slope 
(Ay/Cy = dA;/dCy). 
E = Point where chord through J meets isotherm again (4 j5/C ; = 
Ax/Cz). 
Band N = The same line is tangent to isotherm at both B and N, 
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The isotherm of Fig. 1 has a different shape than the ones discussed by Gluec- 
kauf (1947) and by Klamer, Linssen, and van Krevelen (1958); in addition it 
is more complex than theirs. These differences are immaterial, however, since 
the principles to be discussed can be applied to any isotherm. At the outset, 
it is very important to distinguish between ‘‘stable’’ concentration discon- 
tinuities where the concentrations on both sides of the discontinuity are con- 
stant, and “unstable” discontinuities where the concentrations are varying 
with time. We shall discuss the former case first. Stable situations can occur 
when a plateau region of concentration exists, e.g., the concentration of the 
plateau equals the injection concentration. Let us start by discussing the 
shape of the leading edge of a chromatographic band. The exact shape will 
depend upon the injection concentration. 

1. For that portion of the isotherm between O and J, A/C becomes less at 
higher values of C and yet A/C is always greater than dA/dC. Therefore, a 
vertical front will form at the leading edge, and the velocity of this front can 
be determined from equation (2). A diffuse region of high concentration cannot 
lag behind this front since V¢ of equation (3) is greater than V; of equation 
(2). At the same time these high conceritrations cannot be projected ahead of 
this front as this would be physically impossible. A diffuse region of low 
concentration ahead of this front would travel more slowly than the front 
and, hence, would be enveloped. The net effect is that the front is self-sharp- 
ening. 

2. For injection concentrations above Cy, the shape of the leading edge is 
slightly different. The slope of the chord, A/C, which had been decreasing, 
starts to increase. More important dA/dC is now greater than A/C, and 
therefore V¢ is less than V;. Because of this the front now has the following 
shape: a sharp vertical (discontinuous) front between Cy and Cy with velocity 
given by equation (2) topped with a diffuse (continuous) region with velocity 
given by equation (3). This diffuse region trails behind the sharp front. 

3. At Cy and above, the shape of the leading edge is again different. The 
tangent to the isotherm, dA/dC, begins to decrease. This causes a concen- 
tration step (discontinuous) to form above the diffuse region described in the 
preceding paragraph. The concentration at the top of the step is designated 
C\_) (see equation (1)) and this is equal to the injection concentration; thus 
C\_y is fixed. The concentration at the base of the step is designated C,4) 
since it is ahead of the discontinuity. For stable situations this step is neither 
advancing on nor receding from those concentrations immediately lower or 
higher than C,4). Thus Ci4) is fixed also. C;_) can be any (chosen) value 
between Cy and Cg depending on the injection concentration. C,,) is tem- 
porarily unknown but it can be determined by realizing that C,,) must also 
have the velocity given by equation (3). Therefore 
(4) Aa~Aw _ dAw 
; Co-Cuy = dC) 


Equation (4) can be solved for C,,) provided the exact functional relation 
between A and C is known. When this relation is very complex, as is the case 
in Fig. 1, we can use a trial and error method and try various corresponding 
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values of Cy4), Aw, and dAq)/dCy) until the equality of equation (4) is 
satisfied. The correct set of values can be found in two or three trials. Alter- 
natively, a tangent can be drawn to the isotherm from [C;_), A:_)]. The point 
of tangency is [C(4), Ay], and the slope of the tangent is dA(4)/dCy). By 
any of these methods it is found that for Ck > Ci) > Cu, Cy < Cay < Cu. 

4. For an injection concentration of Cg, the step (paragraph 3. above) will 
have the same velocity as the front (paragraph 1. above). In other words, a 
single discontinuous front will exist from Co to Cg. Also for concentrations 
above Cg a single front will occur and its velocity can be found from equa- 
tion (2). 

In similar fashion let us study next the shape of the trailing edge for stable 
situations. 

5. For injection concentrations between Cy and Cg, a diffuse (continuous) 
region exists at the trailing edge. The velocity is given by equation (3). 

6. Point G on the isotherm of Fig. 1 is an inflection point; thus concen- 
trations above Cg would tend to travel slower than those below Cg. This 
causes a concentration step to occur above the diffuse region of paragraph 5. 
The concentration at the top of the step is now designated C(4), since it is ahead 
of the discontinuity. C(,) is equal to the injection concentration and C,_) can 
be evaluated from an equation similar to (4). Thus for Cy > Cy) > Ce, 
Cp < Cy) < Cg. Once Cy_) and C4) are known, the velocity of the step can 
be calculated from equation (1). 

7. For injection concentrations above Cy, a diffuse (continuous) region 
exists ahead of the step described in paragraph 6. 

Figure 2 provides an excellent quantitative summary of the above discussion. 
If, for example, the injection concentration is slightly less than Cy, the velo- 
cities (and, hence, the shape) of the leading and trailing edges are given by the 
dashed lines in Fig. 2. 

The transport of unstable discontinuities requires a different treatment. 
Since the concentrations on both sides of the discontinuity, or even just one 
side, vary with time, equation (4) is no longer valid and the data of Fig. 2 
are no longer completely applicable. In fact, the velocity of the discontinuous 
step is different from the instantaneous velocity of either Ci4) or Ci). In 
addition, the velocity of the step is changing continuously. This makes it 
virtually impossible to describe the position* of a discontinuity for an unstable 
situation by any simple mathematical expression. The only simple way to 
determine position is by a graphical method based on material balance. This 
method was outlined in the previous paper by Tudge. In order to illustrate 
how this method applies to an adsorbate with a complex isotherm, let us 
consider a process wherein a small slug of adsorbate solution is transported 
by flowing solvent through a linear bed of adsorbent. The following data are 
assumed to apply. 

(a) The functional relationship between A and C is that shown in Fig. 1. 


*When velocity is constant, the position of a concentration or a discontinuity is given by 
the simple equation: 


(5) 


s = V(re—n1). 











oo 


scoeelanadainehieeieennaniaennamannmamnnaamaametimaimneten memmaamntanemmammamenteienanemmm metre mance Tae 





TUDGE: CHROMATOGRAPHIC TRANSPORT. II 1615 





7.0 
6.0 
(2) 

s Cey 3 
°3 5.0 E 
-— ° 
oe 
i ' 
= $ (1) Cus 
e* 4.0 
Es - -- 
25 = | 

2 : I 
S$ 3.0 (3) S| (1) 
w S LEADING EDGE TRAILING EDGE 
a & 2.0 | | 
=~ se 

4 | S 
82 (2) 
as IY 
“> 16 BS 

Ss 
Co a ES | 
0.0 = 
00-10 20 30 “40 50  .60 70 80. 90 100 


RELATIVE VELOCITY, V 


Fic. 2. Velocity of concentrations for a stable situation. 


(6) Volume of adsorbate solution = 0.10 pore volume. 

(c) Concentration of injected adsorbate solution = 6.00 weight units per 
unit volume of pore space. 

(d) Weight of adsorbate injected is 6.00X0.10 = 0.60 weight units per 
unit volume of pore space. 

(e) The leading and trailing edges of the adsorbate solution are abrupt 
discontinuities when they enter the column at 7 = 0.00 and 7 = 0.10 re- 
spectively. 

Figure 3 shows the adsorbate distribution (both in solution and adsorbed) 
at the instant when the injection of the slug of adsorbate solution is com- 
plete. Next, pure solvent is injected into the column. At the time stage of 
Fig. 4, the plateau region is still partially intact, and, thus, the data of Fig. 2 
can be used to determine the position of the concentrations. Eventually C, in 
the trailing edge will catch up with C, at the front. It can be determined 
easily by the use of equation (5) that this event occurs at 7 = 0.256 and 
s = 0.131. Hereafter, the peak concentration at the front declines from C, to 
Cy and the velocity decreases. A single front is maintained during this stage. 
It is important to determine the time and place when Cg in the trailing edge 
reaches the front. This can be determined in the following way: 

(a) At r = 0.256 the plateau region has just disappeared and Cg, in the 
trailing edge is at s = 0.116. 

(6) Based upon values of A;/C; and Ag/Cg, the maximum and minimum 
times at which Cx, will be at front are r = 0.319 and r = 0.312. 

(c) The exact time can be determined between the above limits by a trial 
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Fic. 3. Adsorbate distribution in a transported chromatographic band at + 
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and error process. It was found that the proper requirements, i.e., material 


balance and a peak concentration of Cg at the front, were met at rt = 0.315 
and s = 0.161. Figure 5 shows this time stage. 

Below a peak concentration of Cy the shape of the front becomes more com- 
plex. All time stages require the use of Fig. 5 as a starting point in determining 
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the position of the various parts of the leading edge. In Fig. 6, at r = 0.600, 
the positions of the following concentrations or discontinuities can be deter- 
mined by using equations (2), (3), and (5): 
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Fic. 6. Adsorbate distribution in a transported chromatographic band at r = 0.600. 


In the trailing edge: (i) continuous set of concentrations from Co to Cz, 
(ii) discontinuous concentration step from Cx to Cy, 
(iii) continuous set of concentrations above Cy (dashed 
line). 
In the leading edge: (i) discontinuous front from Co to Cy, 
(ii) continuous set of concentrations above C; (dashed 
line). 
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Fic. 7. Adsorbate distribution in a transported chromatographic band at r = 0.830. 
Fic. 8. Adsorbate distribution in a transported chromatographic band at r = 0.900. 
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The only unknown remaining in Fig. 6 is the position of the unstable dis- 
continuous concentration step between the two dashed lines. There is no 
easy mathematical way to find this. The only way is to use the graphical 
method based on material balance, and to find the position of this step by 
trial and error. The operational details of this method were described in the 
previous publication. As the chromatographic process continues, this unstable 
concentration step approaches closer to the stable concentration step from 
Cz to Cy. When they meet, the unstable step vanishes and the stable step then 
itself becomes unstable. Figure 7 illustrates this situation. With time this 
new unstable step in the trailing edge catches up with the front (Co to C3). 
Then the situation of Fig. 8 occurs. 

DISCUSSION 

In the Klamer and van Krevelen (1958a) publication, Figs. 6(0), 6(c), 8(d), 
and 8(c) representing unstable chromatographic situations are not correct 
because they are based erroneously upon an equality (see their equation (5)) 
similar to our equation (4), which, of course, is only valid for stable situations. 
Glueckauf (1947) made the same mistake. The equality expressed by his 
equation (16) is not true since the chromatographic discontinuity under 
consideration is not stable. 
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VALUES OF THE TRANSPORT COEFFICIENTS IN A PLASMA 
FOR ANY DEGREE OF IONIZATION BASED ON 
A MAXWELLIAN DISTRIBUTION! 


I. P. SHKAROFSKY 


ABSTRACT 


Values are presented for the electronic conductivity for any degree of ionization, 
radio frequency, and d-c. magnetic field strength, and various electron speed 
power law variations of the electron collision frequency with neutral particles. 
Also, the other transport coefficients, such as electron current due to electron 
density gradients and temperature gradients, and energy flow due to an electric 
field and due to density gradients and temperature gradients, are tabulated. 

The analysis is based on substitution of the usual series expansion of Laguerre 
polynomials into the Fokker—Planck equation for Coulomb collisions and into 
the Boltzmann equation for electron collisions with neutral particles. For Cou- 
lomb effects, the expressions are the same as those derived by Landshoff. Collisions 
of electrons with neutral particles are included in addition to ions, and a-c. electric 
fields are treated as well as d-c. magnetic fields. In the limit of a completely 
ionized gas, the results also agree with those of Spitzer and Harm and of Kauf- 
man. For a slightly ionized gas, the results are compared with Allis’ treatment 
and with calculations using the Dingle integrals. It is found that the Laguerre 
convergence is inadequate for large angular frequencies when the power law is 
less than —2 and for small angular frequencies when the power law is greater 
than 1. 

The final results can be put in a form which yields two factors, multiplying, 
respectively, the average collision frequency and radian frequency, to give correct 
results from simple equations. These factors are usually of order one, and are 
functions of three parameters, proportional to angular frequency, ratio of electron— 
neutral to ion averaged collision frequency, and ion charge number. 


1. INTRODUCTION 


The basis of any calculation of the electromagnetic properties of a plasma 
rests upon a knowledge of its electrical conductivity. The conductivity of a 
slightly ionized gas is treated adequately by Allis (1956). In this limit of 
negligible degree of ionization, it is sufficient to consider in Boltzmann’s 
equation only electron collisions with neutral particles in order to derive values 
for the total collision frequency. The frequency for Coulomb collisions is 
negligible in comparison. For the opposite limit of a completely ionized gas, 
Spitzer and Harm (1953) use the Fokker—Planck form for the collision term 
and evaluate the d-c. conductivity by numerical integration. A different method, 
adopted by Cowling (1945), Chapman and Cowling (1958), Landshoff (1949), 
and Kelly (1960), is to expand the collision term in Boltzmann’s equation as 
a product of Legendre polynomials for the angular dependence and of Laguerre 
polynomials for the speed dependence in the electron-distribution function. 
Cowling and Kelly provide the theory which includes electron collisions with 
neutrals, ions, and other electrons and which can include a d-c. magnetic field 
as well. The only complete calculations are those of Landshoff, who evaluates 

1Manuscript received May 23, 1961. 
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the d-c. conductivity in the presence of a magnetic field for a completely 
ionized gas. Gurevich (1959) states baldly that Landshoff’s results can be 
applied to the case of radio frequencies; this will be proved here. Cohen, 
Spitzer, and Routly (1950), however, criticize the Boltzmann expansion 
approach, on the grounds that “the theory is in fact not well suited to handle 
inverse-square forces between the particles in a gas and the accuracy of the 
results obtained is uncertain’. They claim that the Fokker—Planck equation 
applicable to the long-range Coulomb forces should be used instead of Boltz- 
mann’s equation, and since the convergence of the expansion method is not 
known, they integrate their resulting equation numerically. Both of these 
criticisms are investigated here. 

The basic equations used here (Bachynski et al.) result from the application 
of the Fokker—Planck equation for Coulomb collisions and the Boltzmann 
form of the collision term for electron collisions with neutral particles. The 
angular dependence in the equations is expanded as in Spitzer and Allis’ 
analyses in Legendre polynomials. One scalar and one vector integrodiffer- 
ential equation result, linking fo (the isotropic part of the electron distribution 
function) and f, (the directional part of the distribution function). These 
equations are then applied to the subsequent analysis. For calculation purposes, 
fo is taken to be a Maxwellian distribution. This satisfies the scalar equation 
if the forces (electric field, etc.) are small. 

It will be shown that if, in order to solve the vector equation, the Laguerre 
polynomial expansion is inserted, then the results are identical with those of 
Landshoff for Coulomb collisions, including electron-electron collisions. (This 
is apart from the known fact (Bachynski e¢ a/.; Delcroix 1959) that for electron 
collisions with heavy particles, including ions, the results from Boltzmann 
and from the Fokker—Planck equations agree.) That is, both approaches under 
similar expansions lead to the same results. This result may have been expected, 
since the assumptions of a Markovian process and of binary collisions in the 
standard Fokker—Planck equation reduce the physical concepts to essentially 
those valid for Boltzmann’s form of the collision term. Note, however, that 
the expansion in Laguerre polynomials (of order 3/2) is only one method of 
solving the complicated relations. There is, a priori, no reason why a different 
order of Laguerre polynomials or a different polynomial function may not be 
more suitable for some particular cases. The relations derived from the Fokker- 
Planck equation without the use of the Laguerre polynomial expansion are 
thus important for future attempts at more rapidly convergent expansions or 
for numerical integration. 

In this paper, the standard Laguerre polynomial expansion is used with fo 
as a Maxwellian distribution. The conductivity is calculated as a function of 
three parameters, which are proportional to angular frequency, degree of 
ionization, and ion charge number. Seven variations with velocity of the 
electron collision frequency with neutrals are considered,* proportional to the 
rth power of velocity where r = —3, —2,...3. The convergence of the 


*The variation of electron collision frequency with velocity in various gases is given in 
Collision Frequency Associated with High Temperature Air and Scattering Cross-Sections of 
the Constituents by I. P. Shkarofsky ef al., published in Electromagnetic Effects of Re-entry 
Pergamon Press, 1961. 
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method is tested by comparing the calculations with the exact solutions for 
a feebly ionized gas. The exact values can be obtained by using the Dingle 
integrals (1957). It will be shown that the convergence is good for r = —2, 
—1, 0, 1 and good in some ranges of parameters for the others. These calcu- 
lations seem to be the first complete set that include neutral encounters whose 
collision frequency varies with velocity, ions, and electrons, as well as radio 
frequency and d-c. magnetic fields. Furthermore, all calculations are per- 
formed on a 4 by 4 matrix, one order of expansion higher than that of Landshoff 
and two orders higher than that of Cowling. 

A new representation is invented to present the results in a normalized 
manner suitable for both d-c. magnetic field and radio frequencies. Two 
functions are defined which multiply the averaged collision frequency and the 
(w+w,) angular frequency in the denominator of the simple conductivity 
relation. At high angular frequencies, both functions approach 1 for all 
cases. At low angular frequencies, they approach various limits depending on 
the ratio of averaged electron—neutral to electron-ion collision frequency, the 
effective ion charge number, and the variation of electron—neutral particle 
collision frequency with velocity. Gurevich (1957, 1959) has used two different 
functions for the same purpose in the extreme cases of slight and complete 
ionization. The representation given here for any degree of ionization is 
simpler and more appropriate to investigations of the electromagnetic properties 
and of the Appleton—Hartree equation. 

The refractive index for a plasma in which an electromagnetic wave propa- 
gates at an arbitrary angle with respect to a d-c. magnetic field, is given by 
the Appleton—Hartree equation. This equation has been generalized in various 
applications (Majumder 1937; Jancel and Kahan 1954, 1955; Sen and Wyller 
1960; Shkarofsky and Johnston 1960) to account for the velocity variation 
of the electron collision frequency with neutral particles in a feebly ionized 
gas. Using the analysis in this paper and the two functions derived herein, the 
Appleton—Hartree equation can be further generalized to apply to any degree 
of ionization and ion charge number. These results will be published elsewhere. 

In addition to the electron current due to electric fields (related by the 
electronic conductivity), the same analysis is used to yield the electron currents 
due to temperature and density gradients, and the energy or heat flow due 
to electric fields, temperature gradients, and density gradients. Thus, quantities 
such as diffusion constants and thermal conductivities are derived. All these 
transport parameters are also considered by Landshoff (1949)* for a com- 
pletely ionized gas and by Sodha (1959, 1960) for a slightly ionized gas. They 
are tabulated here for any degree of ionization. 


2. BASIS OF ANALYSISf 
The basic assumptions in the analysis are as follows: 
(a) Only the electron current and heat flow are considered; the contribu- 
tions due to ion motions and currents are not considered. 
*To eliminate pressure gradients, Landshoff lets part of the temperature term cancel the 


diffusion term. These two source terms are considered independently here. 
tThis section is intended only as a summary of known relations. 
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(6) The temperature of the plasma is assumed sufficiently low that thermal 
plasma oscillations are not important. Also the a-c. charge density in the 
plasma is neglected. 

(c) Only elastic collisions of electrons with other particles are included.* 

(d) A steady state is assumed. 

(e) The angular dependence of the distribution function can be expanded 
in Legendre polynomials and the convergence is such that all terms but those 
of the first two orders are negligible. 

(f) The collision frequency is a function of velocity only and not of position, 
that is, the scatterers are uniformly distributed. 

(g) Any applied electric or magnetic fields are uniform in space. 

Further assumptionsf arise from the application of the Fokker—Planck 
equation to Coulomb interactions: 

(h) Only a Markovian process of interaction with no correlation is implied. 

(7) Time ensemble averages of only binary collisions between charged par- 
ticles are included in spite of the long-range forces involved. The justification 
of these assumptions is well known and can be found in the many papers on 
this subject. Extensions (Tchen 1959) of the theory, which eliminate these 
assumptions, are beyond the scope of this analysis. 

(j) For the calculations, it is assumed that the sources (electric field and 
gradients) are sufficiently small not to disturb the equilibrium Maxwellian 
distribution. That is, the isotropic part fo of the electron distribution function 
is taken to be Maxwellian. In all this analysis, except in the final calculations, 
fo is only restricted to a form in which the velocity dependence (normalized 
to the average energy, say) can be factored out from the other dependences, 
such as electric field, excitation potential, etc. The velocity dependence of the 
electron collision frequency with neutrals is also assumed to be separable. 

If the electron distribution function f is expanded thus, 


(1) f = fotfiev/o+fieve’/v+..., 
then, subject to the above assumptions, Boltzmann’s equation 


(2) of/ dt+v ‘Vif+ (a +v Xp) Vf — (5f/6t) coutsion 
oe (8f/5t) en + (6f/5t) es + (6f/5t) cc 


yields the following results (Allis 1956; Bachynski e¢ al.) 


(3a) vy, ty ‘(a f1-+5 Re(a’s 4) | 


3 
5 kT, afi (#8) (#) 
etm "mM m (q4H%e 2) | bt 7 dt / ee 


(30) vv fota' (dfo/dv) +e Xft = —rmfit (6f1/6t) it (561/50) c6, 
(3c) joof +a’ (dfo/dv) +oy Xfi = —vmfi+ (5f1/5t) t+ (5£1/5t) ec, 


*Inelastic collisions can readily be included in the fp equation. They do not influence the 


f, contribution. 
{These assumptions are required for the validity of the Boltzmann equation as well. 
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where, considering neutrals and ions of a single gas, (6f/8t)contsion is the 
change in the distribution function due to collisions. Only the electron—neutral 
(en) contribution is written down explicitly above as y,. Also 


(3d) a = a°+ale’! = eE°/m-+ (eE1/m) e**. 


E® is the d-c. electric field and E! is the a-c. electric field, 
w is the applied angular radio frequency, 


(3e) @, = eB/m, 


B is the d-c. magnetic field, 

e, m, v are the electronic charge, mass, and velocity, respectively, 

M is the mass of the heavy particle colliding with the electron, 

Ym(v) is the electron collision frequency for momentum transfer with the 
heavy neutral particles and is a function of velocity, 

Vv, and VY, are, respectively, space and velocity gradients, 

T, is the temperature of the gas (i.e. heavy particles), 

k is Boltzmann's constant, 

t is the time, and 

j=v-1. 

The electron-ion (ei) contribution can be obtained either from the Fokker- 
Planck equation or directly from the Boltzmann equation, with the result 


of) _ kT, af ] 
(4a) (#) =3 — |" re if m (2440 mv dv fs) : 
(46) (##) 


where (Bachynski et al.) 


(4c) Yer = 14Ve/0° = ny4a(Ze’/4rem) In A/v’, 


Sine Ze’ ) a ee (AmeokT)*” 
(4d) Ya _— te( 2 In A and A — 2Ze° (an)'/* ’ 


and n,, m are the ion and electron density, respectively, 
T is the electron temperature, 
ve is the electron-ion elastic collision frequency, 
Z is the charge number on the ion, 
€9 is the dielectric constant of free space. 

Upon comparing eqs. (4a) and (46) with the right-hand sides of (3a) and 
(36), one sees that the collision terms for electron-ion and electron—neutral 
particle collisions are identical in form with appropriate electron collision 
frequencies. In fact, the two contributions can be combined and one can 
write the sum of the electron collision frequencies as 


(4e) %, = Vm+Vet = eee (26 yin A. 


v 4meqm 
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As discussed in the Introduction, the electron-electron collision term is 
obtained from the Fokker—Planck collision term. It can be shown (Bachynski 
et al.) that 








~ 6 0 1 VY 
[) (2) 7 a5 (fo YIv)+—— vate —_ (I: +J° | 
f d + 0 
(5b) (@) 22 ax ae aa (313—+2I%) 
a 
+33 f,;(—32o+J2—2J_1)+87Vfifo 
1 Yd 
+55 +h (Th+J22) +753 F8 (—31+20!2+51)), 
where 
(5c) Y = 4x(e*/4 eum)’ In A, 
4) I; = (4/v’) J fi! dv 
0 
(5e) Ji = (4n/v’) Ss vt? dv. 


Equation (5d) can be derived as well by extending the analysis of Hwa (1958) 
to arbitrary fo and putting the result in vector form. 

If fo is a Maxwellian distribution, the J° and J® integrals in eqs. (5d) and 
(5e) can be deduced in terms of the error integral @ and its derivative ®’, 
where 


2a wil 
@(W'”) = (2/7) J e’dy, ®(W) = (2/Vare", W = mv"/2kT. 


Table I illustrates this point. 


TABLE I 


_Re duc tion n of integrals for a Maxwellian distribution 


Staats 
General distribution fo fo = n(m/2rkT)32 exp( —mv?/2kT) 


19 n(&— Wr’) 
J% nW?)/2@' 
Ig —nW26' +(3n/2W)(&— Wo") 
Je nW1!2p'(1 + W-1) 
12430, (3n/2W)(®— W120’) 
dI9/dv = Anfov? 2WI*,/v = 2nW3'/y 








Equations (3a, b,c), (4a, 6, c), and (5a, b) are now combined, leaving fo 
arbitrary, to yield 


(6a) em Altar = a | o'(aratte} Re(a’-f)) | 


30° dv 





1 A) u 1 afo mv Y 
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(6b) vV.fota’ (dfo/dv) +o» Xfi = —v_(v)fi+ (5f1/6t) ce 
(6c) jof ita’ (dfo/dv)+oy Xfi = —v_(v)fi+ (6f1/5t) cc, 
where the electron-electron collision terms can be obtained from eq. (5d) 


with appropriate superscripts on f,. 
The d-c. plus a-c. electron current is 


(7) J = J9+J'e'. 

The a-c. conductivity {6} is given by 

(7a) J’ = {6} -E’ = (40/3) f fiv’ dv. 
0 


The d-c. current arises from the d-c. electric field, from density gradients, 
and energy gradients. Thus, 


ll 


(7b) J° = {6°} -E°—e{D} -v,n— (2/3k) {2} °V,u 


(4ne/3) { “f° ao, 
: 0 


where {6°} is the d-c. conductivity matrix, 

{D} is the diffusion current matrix, 

{e} is the current flow matrix due to thermal gradients at constant 
electron density,* 

k is Boltzmann’s constant, 

u is the energy per particle and is given by 


—- co 2 0 
o mv of mv \ fo 
(7c) emer m . 4nv =) a dv. 


When Coulomb collisions are negligible, as in a feebly ionized gas, the 
solution of eq. (6c) for f! in terms of f} is 


(8) f= Mia 
where 

imtjw ay 0 
@) M)--—ara| * te 


Similarly, f} is given by eq. (8), if (M°) replaces (M!) and vV,f}+a°df?/dv 
replaces a'df}/dv. (M°) is given by (M') with w = 0. 

In this limit, the transport coefficients can easily be evaluated from eqs. 
(7a) and (76). Writing the a-c. or d-c. (with w = 0) conductivity in matrix 
form one has: 


*Note that this definition differs from the common one in thermodynamics which refers to 
constant pressure. In discharges, it is more convenient to talk about electron density rather 
than electron pressure. 
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11 O12 0 
(9) (6) oy ak O22 0 
0 0 033. 
Then, for the feebly ionized gas, one obtains after integration by parts 
ne (1d v/3 
(9a) G33: = ne! (4 i, =» , 
j v dv Vm +jw a 


2 3 
; senile (3 av | eee deccadl 
m) one m \v dv 6 ackiwions make wp) JI7 a’ 


: 2 
‘ jne 1 dv ] 1 
(9c) — 0.2 = 023 = i ee eee : 
m \v dv 6 Lom+j (wtp) ~ im +j(o— Wp) 
where expressions either in parentheses with subscript ‘‘a’’ or overlined denote 


the following average for a scalar ¢ 


(9d) (?),=¢= (1, nf ofvdav’ dv with n= tn | fo" dv. 
To obtain the other transport coefficients, one applies the identity 


1) r pl O , 
Vio = (fo/n)Vn+nvV(fo/n). 
Also, since f’/n can be taken as a function of the magnitude of velocity or of 
0/ J 
v? and of a normalizing average such as u = mv?/2, one has 
Oy ap + 

nV(fo/n) = Ofo/duVu. 

Hence, 
Oy 


Vfo = (fo/ n)¥n+ (dfo/du) Vu. 


To illustrate, {D} and {+} for a feebly ionized gas are 


(10a) {D} = ae M°}o'fo do, 
(100) {2} = arck| {M°}v'(afo/du)dv = aah 21D} 
0 ~ 


(See comment after eq. (8a) for definition of {M®}.) 

If the electron collision frequency with neutrals is constant with velocity, 
then the averages in eqs. (9a—-d) can be simply evaluated. For example, the 
(33) matrix element of the conductivity is 


ne 
(11) 033 = — ae 
M(Vm+jw) 
It is convenient for many applications to define an averaged collision frequency 
even when v,, is a function of velocity, and apply eq. (11) with this averaged 
Ym. This is possible only for high and low angular frequencies compared with 
collision frequency. That is, for w > va, 
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Mo 3: 1 | 
ne (Ym) ete tJw 30° dv Mmtjo/ 
which yields after expansion 
d q 
(12a) (¥m)r-t = (Ym) = (Z, a v ma) 
Expanding in the opposite extreme of w X vm gives 
Ld 4) 
9 / urea ete 
(120) 1, (Um )a-< = a dv Vin 


Equations (7) express the current flow. The energy vector flow, H, can 


also be defined as follows 


(13) H = {yp} -E°—e{Q} «wn— (2/3k){K} «vu 
(2m /3) J fiv° dz, 


where {K} is the energy conductivity matrix at constant electron density,* 
{Q} is the energy diffusion matrix, 
{u} is the energy flow matrix due to electric field. For a slightly ionized 


gas, one finds that 


2 > 
(14a) fy} = — 222 “(Mo rdi as, 
(14d) {Q} = (2am, Ben) | {M"}fov° dv, 
VK} = 0) 6 fo 4, _ nek A1Qi 
(14c) {K} rmk | | {M} ro di oe 


The six quantities {é} or {6°}, {D}, fe}, {u}, {Q}, and {K} are the transport 
coefficients investigated in this report. 

The analysis becomes considerably more complicated if one has to include 
electron—electron collisions. (Electron-ion collisions present no difficulty; they 
can be incorporated by changing ym to vg in the equations for the slightly 
ionized gas.) Before presenting the complete analysis, it is worth while looking 
at the transport equations based on the above concepts. Three transport 
equations, corresponding to the particle, momentum, and energy conserva- 
tion laws are shown below. One may expect, from physical reasoning based 
on Newton’s third law, that the contribution of electron-electron collisions 
vanishes in the momentum and energy equations. The proof of this statement 
and the derivation of the transport equations are given in Bachynski et al. 


These relations are as follows 


*Note that this definition differs from the common one which refers to constant pressure and 
which is calculated from the peculiar velocity (vv). Note also that only the contribution due 
to electron motion is considered here. 
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on - 
5 —+V,.°¢ = 
(15a) a +V,env = 0, 


(15d) (0/dt) (nmv)+¥,enm{vv} —ne(E+vXB) = —nmy,V, 


=.) 2 (nme? nm 5 = nm, nmkT, (42 3 ) 
(15c) at 9 )4y, o i") neEev = M Vvgt Mu op? . 


where an average for a scalar quantity is defined in eq. (9d). An average for 
a vector A(v) is 


(16) A = (44/3n) J favtae 
For example, using eq. (7a) 
(16a) v= (4n/3n) | f,v'dv = J/ne = {6} -E/ne. 


The average for a dyad tensor {B(v)B(v)} is 


(17) {BB} = (4/3n) | foB*{Lo'dv+ (8x/1in) | {f.} Be dv 


where {f2} is the next order in the Boltzmann expansion and will be neglected 
and where {I} is a unit tensor. (There will be no off-diagonal dyadic terms 
(Johnston 1960) because we drop higher-order Legendre polynomial terms 
giving rise to {f2}.) The collision frequency vg in eqs. (15) is the sum of fre- 
quencies for electron-ion and electron-neutral particle encounters. Electron— 
electron encounters do not appear explicitly. They influence, however, the 
forms of f} and f;, which are necessary for the evaluation of the averages in 
the transport equations. 

The above is intended as a review of the fundamental equations. The 
following investigations (Section 3) provide values for the electrical con- 
ductivity for any degree of ionization. In Section 3, gradients in configuration 
space are neglected, so that the relations for f{ can be obtained from these 
for f} with w = 0 and no separate distinction is necessary. Finally, Section 4 
considers the effect of density and energy gradients on the analysis of Section 
3. The Appendix provides values for the transport coefficients. 


3. CONDUCTIVITY OF AN IONIZED GAS IN THE PRESENCE OF A 
d-c. MAGNETIC FIELD 

The following cases will be investigated in sequence: 

(a) Conductivity when Coulomb interactions are negligible, i.e. for a slightly 
ionized gas. 

(6) Conductivity including electron-ion effects but neglecting electron— 
electron effects. 

(c) Form of the conductivity matrix including all interactions, i.e. for any 
degree of ionization. 
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(d) Conductivity when Coulomb interactions predominate, i.e. for a highly 
ionized gas, under the assumption of a Maxwellian distribution. 

(e) Values for the conductivity for any degree of ionization under the 
assumption of a Maxwellian distribution. 

In all cases, only the electronic conductivity is considered; the conductivity 
due to ion motions and currents is assumed negligible. The temperature of the 
plasma is also assumed to be sufficiently low that thermal plasma oscillations 
are not important. No gradients are included. Furthermore, a steady state is 
assumed. 


(a) Conductivity for a Slightly Ionized Gas 
The results of Section 2 apply to this case. From eqs. (9) to (9c), it is 
obvious that the conductivity tensor (6) can be put into the form: 


b+c j(b-c) 0 





(18) lah so) be 8 
0 0 d 
where 
2 3 
(18a) 2c,2b m \v" dv 3 vm+j(won)/ a 
(minus sign for 5, plus sign for c) 

and 

ne’ (1 d v ) 
(18d) os m (t dv 3(¥m+jo)/ a 


soo 


where the expressions in parentheses with subscript ‘‘a’”’ signify an average 
over the electron distribution function f}, e.g. 


(18c) (a= = (1/n) J“ otayo%ae 


It is convenient for the subsequent analysis to recast these equations for 

b, c, and d into another form. Define new functions, g and h, such that 
2 
(19) eth ic Rn en 
M £4 .0(¥m)+jhs,o(wwpy) 

(minus sign signifies negative wy, plus sign positive w,, and 0 signifies w, = 0) 
where, for convenience, (vm) is chosen to refer to the high radio-frequency 
equivalent collision frequency (w > w)+vm) derived in eq. (12a), viz. 


1d vr) de fx ‘ 
¢ = {-=—-——) = —-— — y Vp dv. 
(192) (ma) (3 SS) a I oy? tu 


In the case of a weakly ionized plasma, the parameters g and h, defined 
in eq. (19), are functions of |(w-twp)/(vm)| only. That is, one restricts oneself 
to either weak or strong electric fields where the velocity dependence in f? 


can be separated from the electric field, and to the elastic range where f} 
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does not depend on the excitation potential. Effectively, one assumes that 
fo can be written as a product of constants, parameters, and a separable 
velocity dependence, normalized with an appropriate velocity average. Also 
Ym is assumed to be given as a product of a velocity dependence and other 
factors. 

For constant collision frequency with velocity, g = h = 1. Thus, these 
functions are reduction factors for (ym) and w, due to the variation of ym 
with velocity. 

By comparing eqs. (19) and eqs. (18a, 8, c), the following relations can be 
obtained for g and h, as functions of (w-+wp)/ (vm) for a given velocity variation 


of ym. 


(4 : a 
(20) g ee eee ee v dv 3[¥m + (w+) | aS ote ——, 


(42 tra oe he oie ) (: d v" ) 
yv dv 3 /,\v' dv 3[¢mtj(wtwp)|/ a v dv 3[%m —j(w+wy)]/ 
a =) Ofo _ __v'vmdv / (Ym) 2 


4a dv a == D A 

| [gee (see) 

(20a) =] af , 
fs v'Ymdv/ (Ym) ; of [ (ww ) / (vm) ]v'dv 








: + 
ta) Fe 
R | 2 )° = (2# ‘s + 


(! : Basa 
(206) Na do 8 vm + (Wo) "Vn a 








: ee is. v ) 
v” dv 3[%m+j (wwp -) (4 dv 3[¥m—j (way) |/ 9 


, 3 
3n fo ssw dv 


Ae 


Ofo 0 Mmdv/{rm) Afo [wo wy) / Ym) |o'dv 


av < (2 a)" av 2 ( )'] 
i Fees Wm) (im)? 


It is seen from eqs. (20a) and (20c) that g and # are real numbers as a con- 
sequence of the fact that (A+ 7B) (A —7B) = (A)?+(B)*. They are also 
positive in value. 

For a Maxwellian distribution and py, « v~%,v-*,...v3 the integrals in 
eqs. (20a) and (20c) reduce to functions, which are tabulated by Dingle 
(1957). These results are presented in Subsection (e). 

Note that g and h reduce to 1 for |(w-w)/(vm)| >> 1 so that g(o) = h() 
= 1. This is due to our choice in defining (v,) according to eq. (19a). For the 
opposite extreme w+, = 0, one has 
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1d v" v lduv 
9 = 
(20d) g(0) 1/ (43 dv 3 a) (42 dy e) 


and 


ldv AG dv Ey 
9 eas Ss... eo 
(206) h(0) (3 dv 3¥%m/ v° dv 3¥m/ a 


(b) Conductivity When Electron—Ion Effects Are Included 
If one includes electron-ion interactions but neglects electron—electron inter- 
actions, one can still define an electron collision frequency with neutrals plus 
ions, given by eq. (4e), viz. 
Ze” 
21 Vv) = Ym(v) mete yin a 
(21) re(0) = ra (0) +785 (Ae 


where the parameters have their usual notation. One can also derive a value 
of (»,) by averaging according to eq. (19a). 


(¥_) - (Ym) + (Ye) 


where 


; a Ze’ a fz 
9 Aes oe Cjo 5. 
(28) 3. n \Arecm in A 0 Ov ie 


- ee =. yin Afi(v = 0). 


3 nn \4reqm 


For example, for a Maxwellian distribution, f}(v = 0) = n(m/2mrkT)*/? and 


? 4(2 ae) n.( Ze* ) ey" 
») — aone 
(23a) (va) = 3 twekT In A, 


Equation (23a) is very common in the literature and is used as an equivalent 
electron-ion collision frequency. Note also that one can add effective collision 
frequencies according to eq. (22), only because the high radio-frequency 
average is chosen for defining (vg). 

From the discussion in Sections 1 and 2, it follows that the analysis of the 
previous subsection still applies with v, assuming the role of ym. One can 
define functions g and h by eqs. (19) to (20e). However, g and h now depend 
on (¥m)/{ve) as well as on |(w+wp,)/ (vg)! for any given variation of », with 
velocity. 

The same analysis can be applied to a mixture of neutral particles. Let 
(vg) be the sum of the collision frequencies of each species including ions, and 
obtained by adding averages according to eq. (22). For a mixture of 7 neutral 
molecules, one can derive functions g and h which will depend on the 7 ratios 


of (vg)/ (vm). 


(c) Form of the Conductivity Matrix Including All Interactions 

The object here is to prove that the conductivity matrix has the form of 
eq. (18) even if electron-electron collisions are included. Although the co- 
efficients 6, c, d are not given by eqs. (18a, 6), they can still be written in the 
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form of eq. (19) with g and / as functions of three parameters, |(w-twp)/(¥m)|, 
(vm)/{vei), and Z, for any given variation of ym with velocity. Of course, the 
values of g and h will differ from eqs. (20) to (20e). At best, they can be 
tabulated or plotted versus their functional parameters. The usefulness of 
such plots is apparent and justifies their derivation. 

The analysis follows from eqs. (6c) and (56) for f;, namely, 


df 
dv~ 
df, 


ES 0. gO. 70 5 ee 
+533 (3I—N+2I%x) F 


(24) jofita df 5 to X fi = ~ refit 5 (ate 


tae (—3+1—-2F°)f:+82Vfef: 


et 313+231.+5I)), 


= (4r/v’) | fyw'**dv, Ji = (40/v’) J fv? "dv 
0 v 


a = cE/m, @, = eB/m, and Y = 4n(e’/4meom)' In A. 


The above vector equation can be decomposed into three scalar component 
equations, which are as follows when B is along the z axis or w, along the 
minus 2 axis: 


(25) jof iz t+adfo/dv+ wp fiy = —vgfizt+[O] fiz, 
(25a) jof y+a,dfo/dv—ufiz = —rvefytlOlfu, 
(250) jof etadfo/dv = —vefizt[O] fiz, 


where w, = |eB/m| and [O] is an integrodifferential operator which can be 
deduced from eq. (24). It is a consequence of the fact that in a given com- 
ponent equation, [O] operates only on this component of f;, that the final 
results can be deduced. 

Define two new complex distribution functions, namely, 


(26) fo = fiztify and fo = fiz—Ifry. 

Also define complex electric fields 

(26a) a, = a,+ja,, Qy = A;—jay,. 

Then addition and subtraction of eqs. (25) and (25a) yields 

(27) {j(w— op) +¥e—[O]}f. = —ardfo/do, 

(27a) (7(wt+oy)+re—[O]} fo = —Awdfo/do, 
{jotve—[O]} fiz = —adfo/do. 
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The reason for introducing f, and f, is now apparent, since one now has 
separable equations in f, and f,, instead of mixed equations in f1, and f,,. Note 
that the solution of either f, or f,, (even for w = 0 but w, ¥ 0) is sufficient to 
provide solutions for all three distribution functions f,, fy», and f1;. One simply 
understands by w, the values for w-+twp, and w respectively. 

Consider now the conductivity relation, eq. (7a), 


(28) (4c*/3m) | f,v'dv = {6}-a. 
0 


If oi; are the elements of the conductivity matrix (6), it is obvious upon 
comparing the z components of eq. (28) with those of (276) that o3; = o32 = 0. 
If the other two components of eq. (28) are combined into complex form 
according to eqs. (26) and (26a), and if they are compared with eqs. (27) 
and (27a), one finds that o13 = o23 = 0, o1; = o22, and o12 = —o2. Hence, 
eq. (28) yields 


(4 re" /3m) J f°, 


(28a) (o11+jo21)a, = 
(28d) (011—jo21)dy = (4re*/3m) | feo%de, 
(28c) o320, = (4 re’/3m) | frs%do. 


Equations (28a) and (288) can be solved for 01; and o2;. Thus, 


(29) 61 = 62 = ae Ane. Jf (auf otafy)v'dv = b+¢, say, 
(29a) 02) = —On = at Ane. f (Auf ,>—Arfy)v dv = —j(b—c), say, 
where 

(29b) 2b = (4me"/3ma,) J f0'ae, 

(29c) 26 = (4ne"/3mag) | foe. 


Furthermore, for the 2 component 


29d) a= 633 = (4 re” /3ma,) fi'dv. 
(29d) 


Since fi, = f, or fo when w, = 0, therefore d = 26 or 2c when w = 0. Fur- 
thermore, the statement that eq. (18) is the form for o has been verified, and 
eqs. (296) to (29d) give expressions for 2b, 2c, and d. Note the similarity in 
forms of the general eq. (28) and the complex component eqs. (298, c, d). 
Consider finally the momentum transport equation, eq. (155), for the 


situation of no gradients and of a steady state. 
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(30) jonmvy —ne(E+vXB) = —nm »,V = — (4xm/3) { “twee. 
0 


Since nev = {6} *E, one can readily show from eqs. (30), (26), and (26a) that 


(30a) a,[j2b(w—wy) — (ne”/m)] = —(4ne"/3m) J forgo’, 
(30d) Ay[j2c(w+-wp) — (ne”/m)] = — (4re"/3m) J fergo%ae, 


(30c) a,[jdw— (ne”/m)] = — (4re/3m) J fiavgv dv. 
Eliminating a, and a, by substituting from (296) and (29c) yields 


(31) 2b _ ne jm 


; j ta on If vef v dv fs Sistae| 


ne’ /m 


i yess in ice am Deepen 
j(oton)+| frefer'd / Sree | 


27 


(31d) $2 ye 

jut} f Veh rd" 
In each equation, the quantity in the square brackets in the denominator is 
complex so that it contributes to the imaginary frequency term. This is 
equivalent to saying that there exists a function (denoted here by h) multi- 
plying (ww), which differs from one in general. 

The above results are quite general. If we adopt the assumptions discussed 
in (a) that the velocity dependences in fy and vm are separable, then it can 
be seen from eqs. (27) to (278) that f,, fo, and f1, can be solved in principle 
as functions of v, (w-kwp)/ (vei), (¥m)/(va), and 1/Z. Integration over velocity 
in eqs. (31) to (318) eliminates the velocity dependence. Thus, one can write 
eqs. (31) to (318) in the form 


ne’/m 
32 t2b4 = 
~ =e (Yq) £+j (wap )h 
(negative w, for 6, positive for c, and w, = 0 for d), 
where g and h are functions of three parameters, which can be conveniently 
chosen to be 


fg 
(33) Vj f [I osr) /re)|, Pm)/ Wer), Z] 
for any given variation of vm with velocity. If one neglects electron-electron 


interactions, the Z dependence does not exist (Subsection (6)) and if one neglects 
electron-ion effects, the (vm)/(ve) dependence is also eliminated (Subsection 
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(a)). For a completely ionized gas, only the (vm)/ (ve) dependence can be neg- 
lected. In general, for any degree of ionization, all the parameters should be 
considered. 


(d) Conductivity for a Highly Ionized Gas (Maxwellian Distribution) 

In a highly ionized gas, the frequency of electron collisions with neutrals 
is negligible and only electron collisions with ions and other electrons need 
be considered. 

Spitzer and Harm in their classic paper (1953) solve for the distribution 
function f; and for the conductivity under the restrictions of w = w, = 0 and 
fo, a Maxwellian distribution. Effectively, a solution is found for eq. (24) 
with w = w = 0 and » = vy = ny Yq/v*. The results depend on the ionic 
charge number Z. 

Figure 1 shows schematic curves for dJ/dv = (4mre/3)f,v’, the contribution 


0 
0 4 a 5 2:0 SS 30 
vims2kT)!/2 


Fic. 1. The distribution function f; multiplied by (47ev3/3) to give the current per electron 
velocity dv for three cases of interest: (a) transverse to a strong magnetic field, Z < ~, 
B= o;(b) Z=1, B =0; (c) Z = ~, B =0, a Lorentz gas. 


to the electric current per electron velocity interval dv, at velocity v. Curves 
(6) and (c) correspond to an ionized gas, in the absence of magnetic field under 
d-c. conditions, with Z = 1 and Z = © respectively. Curve (a) is for the 
case of a strong magnetic field. The distribution (c) with Z = © is provided 
by the so-called Lorentz gas, a hypothetical fully ionized gas in which electrons 
do not interact with each other, and all the positive ions are at rest. Since 
electron-electron collisions are neglected, f; is related to fy by eq. (8), which, 
for a Maxwellian distribution, o, = w+=0 and vy « v-%, reduces to 
fi = Av‘ exp(—mv?/2kT) where A is a constant. Equation (31) for the con- 
ductivity becomes with w = w, = 0 and 


N+ Pi ni4 T ( 
6 eS ne 
= v Vv 


* Sfw" dy ne” * Sfw* dv 


= foafodv mnyYa ffridv 


_2mn_ (exe 
Zn,inA\ e ™m i 
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where subscript ‘L’ refers to a Lorentzian gas. This equation is valid for large 
Z values, that is, for a multiply ionized gas. It is also valid when the ion 
density is much greater than that of the electrons, for example, when the 
plasma has a high negative ion density. It will also be shown later to be 
valid in a feebly ionized gas if yy « v~*. 

If one substitutes (v..) from eq. (23a) into eq. (34) one obtains 


2 
ne 32 

34 = — i ) 
asi - m \3m(ve) 
It follows that the function g for this case (w = w = (ym) = 0 and Z = ~) 
is given by g(0,0,©) = 37/32 = 0.2945, according to eqs. (33) and (34a). 

For other values of Z, Spitzer and Harm obtain results, which they express 
in the form 
(35) OC; = Vyeu 
where yx depends on Z. Values for yg and the corresponding g function are 
given in Table Ila for various values of Z. 


TABLE Ila 


6 © 








YE .582 - 68¢ 923 1.000 
2(0,0,2)  .506 431% 3752 3191 . 2945 





For the important case, Z = 1, one has 


m (4x \7/2kT\*" 591 (RT)' fey 
. a Seer eee . RRO) ee 
(36) vt in a e€ ) eS (1.1632) 1 e 


ioisi(#2) —— f — (1.53 10~*) mho /meter. 
é In A 


The numerical expression for A in (36) is obtained from eq. (4d) with Z = 1. 
(36a) Av= 1:24>¢101(7*/n) 2, (m in 1/meter®). 

In the presence of a strong magnetic field, Spitzer (1952) finds that the 
conductivity transverse to the magnetic field is given by eq. (35) with 
Yep = 37/32 = 0.2945. We note that g(~,0,1) = 1 for this case. By the 
definitions of g and (vq), this must be the limiting value of g as |w-+tw,|— ~. 
Figure (la) in the graph for d//dv corresponds to this case. 

For Z = 1 in d-c. electric with no magnetic field, it is convenient to write 
(366) ao, = ne?/m (Vion) 


where by comparison with eq. (36), and using the g(0,0,1) value 


(36c) 1/ (rion) = 1/[g(0,0,1) (va)] = 1.9747 / (ver) 


a 1.1632m°* ee ey" 
~ min A e wi 


_ 1.9747 6/2 €0° (akT)?m? 
ne‘In A . 
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It is seen from the above theory that, for a completely ionized gas, the 
electrical conductivity is approximately proportional to the 3/2 power of 
temperature, and it changes only very slowly with ion concentration density 
through the logarithmic term. Moreover, it is independent of the chemical 
nature of the gas. 

The above expressions derived from the work of Spitzer and Harm give 
only the d-c. conductivity of completely ionized gases. They cannot be directly 
applied under a-c. conditions. Landshoff (1949), however, obtained solutions 
for the transverse d-c. conductivity in the presence of an arbitrary magnetic 
field. From the discussion in Subsection (c), his analysis is directly applicable 
to the r-f. case, if w, is replaced by (w-tw,). These results are derived in the 
following Subsection (e). For convenience, they are tabulated below in Table 
IIb in terms of the g and A functions for Z = 1. For zero frequency, there is 


TABLE IIb 


watwn e( ene 0,1) a( ohew 0,1) 
(vei) (vei) (vei) 


0 0.5128 . 207 
0.02 0.5129 . 207 
0.05 0.5134 . 206 
0.1 0.5149 . 203 
0.2 0.5203 . 1943 
0.5 0.5431 . 1601 
1 0.5781 . 1270 
2 0.6516 . 0962 
5 0.8375 .0434 
10 0.9440 .0149 
0.9845 .0041 
50 0.9974 .0007 
ed 1.0 0 


a 


only a slight difference between these results and that of Spitzer (0.506 
instead of 0.513), due to the approximate expansion method of Landshoff 
(see Subsection (e), where results are plotted, based on a higher order ex- 
pansion). 

The reader is referred to the papers by Dougal and Goldstein (1958) and 
Anderson and Goldstein (1955), who summarize differences between various 
theories under various conditions, such as d-c. or a-c. They also present 
important and interesting experimental results and comparison with theory. 

To include the effects of neutrals, one either computes numerical solutions 
of the integrodifferential equation (Hwa 1958)* for f,, eq. (24), or adopts an 
approximate method of solution, such as the expansion in generalized Laguerre 
polynomials, used by Landshoff (1949). The expansion method is presented 
in the next subsection. 


(e) Conductivity for Any Degree of Ionization (Maxwellian Distribution) 
Landshoff (1949) expands the collision term in Boltzmann’s equation directly 

in terms of generalized Laguerre polynomials (or Sonine polynomials), without 

applying the Fokker—Planck equation. The method is based on the general 


*It may also be possible to apply a variational principle. (See I, B. Bernstein and B. Robin- 
son, Report MATT-9, June 1959, Project Matterhorn, Princeton U., N.J.) 
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theory of Chapman and Cowling (1958). This formulation is followed by 
Grad (1949) and also by Kelly (1960), who includes electron collisions with 
neutrals, ions, and other electrons. The alternative approach, used in this 
report up to now, is based on the Fokker—Planck equation, which has led us 
to eq. (24). It will be shown that if the Laguerre expansion is substituted 
into this equation, then these results and Landshoff’s are identical. In all 
this analysis, it is assumed that the zero order distribution is Maxwellian; 
this is correct providing the disturbing forces, much as the electric fields, are 
small. It is also assumed that only elastic collisions are important. Dreicer 
(1960) investigated the high energy range where inelastic collisions have to 
be considered. However, to calculate the new distribution function fo, he 
included electron-electron effects only in the fo equation, and not in the f; 
equation. 

In the method investigated in this subsection, one expands the velocity 
dependences in f, or f,, the combinations defined in eq. (26), as a series of 
generalized Laguerre polynomials (Erdelyi et al. 1953), thus 


(37) fu = fo(W) DD, pL;?(W) = W'"fo(W) Dd pL;(W)/6 


where 


(37a) 





W = mv?/2kT = Bv?, B= (m/2kT)*?, 


fo(W) = n(m/2xkT)¥2e-" = npr" /x*/2, 





and L?/? is the generalized Laguerre polynomial* of order 3/2. Henceforth, 
the superscript 3/2 is to be understood. These functions are chosen, since 
they converge well for small deviations from a Maxwellian distribution, and 
they represent an orthogonal set. The particular choice of order 3/2 is made 
because it simplifies the analysis and result for electron—electron interactions. 
The orthogonality relation is 


(38) | WL, (W)L,(W)dW = [['(r+5/2)/T(r+1)]6,,. 
“0 


For reference, the first three polynomials are ZL) = 1, L, = 5/2—W, 
L, = 35/8-—7W/2+ W?/2. 

In eq. (37), p, represent coefficients in the series and are independent of 
W. They will be determined from the solution of the integrodifferential 
equation for f, and f,. The coefficients for f, differ of course from those for f,. 

Choose a particular term in the general expansion aud denote it by 
(fu.r)r = Pr(vfol,). Substitute this term into the collision expression eq. (24). 
If ry is left arbitrary, the result can be applied to all r terms. After substitu- 
tion, one multiplies [(0f»,,/0t),,-/pr] by -"W%?L,(W)dW/[T(5/2)efo], where 
L, represents various Laguerre polynomials. On integration over W, the 
orthogonality condition, eq. (38), is sufficient to evaluate terms on the left- 
hand side of Boltzmann’s equation. The electron-electron interaction terms 
on the right-hand side do not behave as well. This is a consequence of the 


*The generalized Laguerre polynomials are closely related to the Sonine polynomials, used 
by Chapman and Cowling (1958). 
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awkward nature of like-particle interactions. If these are ignored, one can 
obtain the answer directly, since the collision term becomes simply eq. (46) 
with », given by eq. (4e) replacing yp. 

The interaction terms on the right-hand side of Boltzmann’s equation can 
be obtained by employing the generating function. Instead of trying to 
evaluate an integral involving Laguerre polynomials and an awkward func- 
tion, one multiplies Z, by ¢” and one sums over r. Consider the generating 
function for Laguerre polynomials, namely (Erdelyi et al. 1953), 


(39) (1—¢)-"exp[-—¢W/(1-)] = YS gL, (W). 
As a result 


(39a) DU DU sn LL, = (ln)? (1—¢) exp{ — Win/ (Ln) +6/ (1-5). 
The integrals for which this method is required are of three forms, namely 
(in the following, B = «-"W*/?/T(5/2), C...F represent functions of W, 
and G, H represent functions of ¢ and 7 only), 


®co w 
J B(W)L,(W) (f CUW)LAWyaw)aw = H°. say, 
0 


(these are the really awkward integrals) 
J B(W)L,(W)D(W)L,(W)dW = Hf), say, 
0 


(3) 


J “B(W)L,(W)[(a/aW)(E(W)L(W))dW = H®, say. 


To solve for the H,, coefficients, one multiplies through by ¢'n* and one sums 
over r and s as follows 


2, tet - f° (x n'L.(W)) FW.) 
where 

Ww 

FO (Wg) = | cw eL,(W) Jaw 
° <a 

Gg-ty" | C(W) exp[—We/(1—-4)]dW, 
(2) ret , et a —5/2 7 ayy Se oe 
F®(W,o) = D(W) DS gL. (W) = (1-9) D(W) exp[— Wt /(1—$)], 
F® (W,t) = saw] BO) LW) | 


= (1-5) °"0/aW[E(W) exp[— We/(1—$)]], 


and use is made of eq. (39). Similarly one can evaluate 
E vn'H!? = 1-2)? f“BW) expl-aW/-a) IF Wa 
0 


= G'(nt) = D> n'y, 
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where G‘(n,¢) is also expanded in a double Taylor series with coefficients 
H‘®, Obviously, since the power series is unique, these are the coefficients 
originally sought. They are obtained by using the generating functions, to 
yield integrals which can readily be evaluated, instead of the original more 
difficult integrals. One can tabulate the values of the coefficients for all 
values of 7 and s, by putting them in matrix form, with r representing rows 
and s columns. These matrices will be presented later. 

Upon substitution of the velocity dependence (fw,.);/pr = ufoL, into the 
collision expression, the following points should be noted. One can first multiply 
by ¢’ and sum, and later, after L, is inserted, perform the same operation 
with 7°. All the necessary operations are of the form given by the functions 
F((W,¢), after multiplication by ¢’ and summation over r. The function 
C, D, and E contain the contribution vf , which combines with the exponential 
to yield a velocity-squared dependence of W'exp{—W[1+¢/(1—$)]} or 
W'exp[—W/(1—£)]. Hence, it is seen that the simplest procedure, rather 
than dealing with vfoL, directly, is to evaluate the results after summation 
over ¢ on the rth term of eq. (24) using the much simpler function W'/*exp 
{[—W/(1-—)]. That is, work with 


(40) QO &"(fo.)r/Pr = W'"'fo dy §'L,/B 
= (nB°/n*”)(1—t) °°W"exp[—W/(1—-5)]. 


Consider the collision terms (all but the first) in eq. (24) which describe 
electron-electron interactions. When necessary, differentiate eq. (40), use 
Table I for the 7° and J® integrals, and evaluate the J' and J' integrals, defined 
by eqs. (5d), (5e), by integration by parts and with a term incorporating the 
error function. For example, for the J' and J' terms, we consider 


se pe Mist Joa) e = (1 “14 - sing exo] =] (1-¢) 11 itt 2] 


Pr 
1in(1=2)"? ( W )"} 
+ 4gw*” 1 —¢ f 
and 


wr (—B15+2I*2 +511), _ so J _45n_(1—5)° =m 
= $ Pr ar {dg} \2n'78 W ex p l-¢ 


eel te 8 | ( Sg: 
+28 WL" ow SLM) 


where B = (m/2kT)'/2 and where 








- "2 4 
(x) = (2/n”) 3 e dy 
is the error function. 
The electron—electron contribution to the rth part of the collision term, 
namely (0fee/dt);,-, can be grouped, after summing over 7, into three terms, 
with respective multipliers fy exp[—W/(1—¢)], ®(W'/*), and ®[W'/?/ (1—¢)"/?], 


as follows: 








SHKAROFSKY: TRANSPORT COEFFICIENTS 


(41) = = 1 (as) : = y(i-¢)*” 1th foexp( = ) 


ae to =2 [1 1-2-30-1)) | 


si —2(1-9) +201 — (1—9)] 
+26, 3 : 8(W 5 exp( = Wy] wW(1—¢)° | 


1/ 5/2 i 
neta] (JH) ] O50" aa—n—nf 


where m is the electron density and Y = 4x(e?/4aeqm)? In A. 

The electron-heavy particle collision contribution, (0f.¢/dt);,-, can simi- 
larly be expanded. The result, obtained from eq. (37) and from the first term 
on the right-hand side of eq. (24), is 


he re (x) = —(W'"/B)fore(W) Do s’L,(W). 


The left-hand side of Boltzmann’s equation, given by eqs. (25) and (25a), 
is 

(43) —eE/kT+j (wep) > p,L(W) = (af/at)-(8/W""fo). 

No summation involving ¢’ is necessary on the left-hand side of eq. (43); 
only the orthogonality property is required, since the terms are simple. 

In this procedure, eqs. (41) and (42) are also divided by (W'/*fo/8), and 
the three equations, (41) to (43), are multiplied by W*/*e-"L,(W)dW/T (5/2) 
and integrated over W. For eq. (43), the orthogonality relation, eq. (38) is 
used, but for the other two, the generating function, eq. (39), is reapplied 
with, say, 7° Pe ¢’. As a result, one obtains 


(wo) (s+5/2) 
44 ———~—_—_—_ »,b,, = Hi? +H*)+H%s 
(44) +IFG/2 rd) Pe = — Le bAe +H) +H 
where 6,,; represents a Kronecker delta, and the H,, coefficients are evaluated 
in the following for electron—molecule (em), electron-ion (ei), and electron- 
electron (ee) interactions. 


Jy{em 


‘m, the contribution due to neutrals, is obtained by expanding and identi- 
fying terms in the relationship 
- s cn Wee ow Lt) (s/n) , 
45 ¢'n‘Hs; = — j alarm rreecgaeno dW. 
5) 2 en JX 2X pew %he/ayp, he 


Hence, after substitution from eqs. (42) and (39a) 


(45a) >> a oH 


a—)-*"(1 aa 2 J fo ee 
i et nm (W)W°exp) —W| 1+----+-— |( dW. 

(5/2) mal WW ep) tetas 
In the following, it is assumed that vy, varies as some power of velocity. This 
is not a restriction since for any other variation, eq. (45a) can be used directly. 
One writes », in the form 
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(46) 'a = I (5/2) (Ym) W"-*/2/T (m-+1). 


That is, y, « v?"-*, Note that m = 0 can be applied as well to electron-ion 
collisions. Also the constant in eq. (46) is so chosen that eq. (19a) holds for 
a Maxwellian distribution, namely 


(46a) (mm) = (se (')).. 
Integration of eq. (45a), with », given by eq. (46), yields 


(47) D on He = vm) (1-g)" (19) (tn). 


To obtain H*™, one expands the right-hand side of eq. (47) in Taylor or 
binomial power series and one seeks the coefficient of each ¢'n* term. The 
coefficients can be put in matrix form with 7 denoting rows and s columns. 
The results are given explicitly up to r = s = 2 and yields a 3 by 3 matrix, 
namely 


+ (m-+1) 


(m+1)(m+2) 


+(3-m) em) 


The dots indicate that the matrix can be expanded to as many terms as 
desired. Note that if v, does not vary with velocity, then m = 3/2 and the 
matrix is diagonal. 

H*, the contribution due to ions, can be found from (48) by writing (vq) 
instead of (vm), where (vq) is given by eq. (23a), and letting m = 0 (vy ~ 1/0). 


1 3/2 15/8 
et 3/2 13/4 69/16 
(H7.)" = (va) 15/8 69/16 433/64 
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Finally, He, the contribution from electron-electron collisions has to be 
evaluated. This is the most tedious calculation. It is obtained from eq. (41), 
by an operation similar to that for electron collisions with neutrals, given 
in eq. (45). To perform the resulting integrations, the following equations 


are useful (Erdelyi et al. 1954) 







(50) J eww = roe ; 
x 4 —sWw 1/2 yn ze ee ad Sa l 
(50a) J a VeW' WW = (-1)"Faap 





(Actually integrals of the form of eq. (50a) occur only with m = Oor 1. This 
is a consequence of the choice of Laguerre polynomials of order 3/2.) Define 
for electron-electron effects, an averaged collision frequency 


s (van sae . e_.) (AE) 4Y¥ng° 
woe 9 4nehT calennlt i 


After integration, the results in terms of (v) are in the order written in eq. (41) 
(52) —U f'n @nr—)™” =p? (1—s)’ 


) mi (2 +(-1)*-30-9') | 
+=) =a) @= t= 9) MIF 


+ = 2(v) 5+ 2(v)s (1—1) 373 
(1—§)?(1—n) (1— fn) (2—f— 0)” : 













(1—$)(1—$9)*(2—¢—9)” 

OS a. 14 (v)(2—35) 
(1—s)?(1—gn) (2-50) °F" (10) (2-g 0)?" 
To combine the 10 terms algebraically, it is simplest to add consecutively 


terms 7, 4, and 8, terms 1 and 9, and terms 10, 5, 6, 2, and 3, and then the 
three groups consecutively. The final result is 


8+45+4n+fn— 26°n—2¢n-° +3¢%n'] 
(1—f)*(2-¢—9)” 


Expanding eq. (52a) up to r = s = 2, yields 
g J 





a 










(520) — DD rata = lo 


















0 0 0 
0 1 3/4 
0 3/4 45/16 





(53) HSS = (v)2"% 






Denote 
(54) Hs = HS? +H3,+H*.. 


The elements of H,, therefore consist of the sums of those in eqs. (48), (49), 
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and (53). One can write down the sets of equations derivable from eq. (44) 
fors = 0,1,2,... . Up tor = s = 2, they are explicitly* 


(55) j(wtwp) potpolootpibiotp2He20 bie eE/kT, 
(55a) JO (wtp) p1/2+po.HutpiAiitpeHa... = 0, 
(55d) 735 (wtp) p2/8+poHo2et+piiet peer re _ 0, 


In matrix form, eqs. (55) to (550) are 
(55c) 


1 j(wtoy) +Hoo Ho Fo2 eis w Po eE/(v)kT 
(vy) Ho 5j (wewp)/2+Ai1 Hy . Pr 0 
we Hoe Ay. 35j (wun) /S+Hos . . eee 0 


Let (A) represent the matrix on the left-hand side. Equations (55) to (55d) 
or (55c) have to be solved for the coefficients po, pi, pe... . By multiplying 
each side of eq. (55c) by the inverse matrix (A~'), one finds 


Po eE/(v)kT 
(55d) pi \ = (4°) 0 
pe 0 


If |Aoo| is the (00) minor of |A|, the determinant of (A), then the solution 
for the coefficient po is 
/ 1 |Aoo| eE 
(56 vel oe 
pe diel v) |A| kT 
Similarly p; and ~2 can be found, and substitution into eq. (37) yields fy... 
As far as electromagnetic properties (conductivity) are concerned, only po 
is of importance. To show this, note that 


(57) ee NV,» = (47e/3) | fe..0%dv 


(47e/3) J fo" = p>L5(W)dv = nekT po/m 


where the orthogonality relationship has been used. As a result, one obtains 
for the conductivity (cf. eqs. (28a) to (29a)), 


(58) 2¢,2b = (oF jon) = Jw,r/Ew,» = (ne?/m(v)) (|Aoo|/|A]) 


where the + sign in eq. (58) has to be taken in conjunction with 2c,2b and 
with the + sign in eq. (55). For w) = 0, 033 = d can be obtained. 

*In this analysis temperature and density gradients are neglected. Landshoff shows that 
including them would require writing (5/2)(V7/T) instead of zero on the right-hand side of 


eq. (55a) and adding (—Vn/n—-VT/T) to the right-hand side of eq. (55). This is investigated 
in Section 4. 
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Equation (58) is the basic result of this analysis. It shows how the con- 
ductivity can be found by adding the individual contributions of radio fre- 
quency, magnetic field, electron-electron, neutral and ion effects in matrix 
form. This equation, except for the r-f. and electron—neutral particle contri- 
butions, has been obtained by Landshoff and his matrix terms agree with 
those calculated here from the Fokker—Planck equation. Thus, the equiva- 
lence of the Boltzmann expansion approach, used by Landshoff, and the 
method of expanding the Fokker—Planck equation, adopted here, has been 
demonstrated. 

Note that, from eqs. (58) and (32), the g and # functions are given by 


= Re(/al/dul) 4, — Im({AL/|Aee) 


(59) B/W) (ween) Kv)” 


Equation (53) for H® possesses the interesting property that there are 
zeros in the first row and first column of the matrix. The following reasoning 
shows that this has to be the case. First note that the form of the H® matrix 
is independent of the vg, variation with velocity, and any particular variation 
can be chosen without invalidating the argument. When »,, equal to rym+vei, 
is constant with respect to velocity, the g and h functions are identically 1.* 
Since g = h = 1, eq. (59) requires that 


(60) Re(|A]/|Aoo]) = (vg)/(v) and Im(|A]/|Aoo]) = (wstap)/(v). 


This is only possible if Ho, = H,o = 0 in the A matrix (see eq. (55c)) and 
Hoo = (vg). Now, the sum of the matrices H*™ and H* is indeed diagonal 
for constant yg, since it can be expressed by eq. (48) with (v,) replacing (vm) 
and m = 3/2. Also, the (w+w,) contribution is diagonal. The obvious con- 
clusion is that He = 0 for r = 0 or s = 0. Since H® is independent of yg, 
the previous statement is also true when v, varies with velocity. 

Another property of the H® matrix is that the r = s = 1 element is 
(v)V/2. The significance of this will be discussed in Section 4, eq. (96c). It 
can also be shown that the elements in the first two rows and columns of the 
He matrix would result from any expansion in which the first two terms are 


(61) fo. = W*fo(W) [pod +p1(B+W)]/8 


where A and B are constants and fo is a Maxwellian distribution (see eq. (37)). 

One notes that if ym varies as v~*, the He™ and H*, contributions, given in 
eqs. (48) and (49), can be combined, and the result can be expressed by eq. 
(49) with (»,) replacing (vm). Otherwise, one can define an equivalent charge 
number 


(62) Le - (v,)/{v) = ((ver) + (¥m))/ (v) = (1+(¥m)/ (ve) (Z?n,, n) 


and then 


*This can be deduced from the transport eq. (155) which, for constant vg and no gradients, 
becomes 
(jwot+vg+(eB/m) X){6}.E = (jo+y,+(eB/m)X )nev = neE/m. 
The solution for {6} is then given by eqs. (18) and (19) with g = & = 1, 
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i 
(62a) H,, = Hyi+Zeall ss. 


Consequently, the Lorentz result for the conductivity, given in eq. (34) for 
a very large ion charge number in a completely ionized gas, also applies to 
finite charge number in a neutral dominated gas ({vm)/{va) = ©), provided 
Ym Varies as v~*, 

Numerical values were calculated from the above analysis up to r = s = 3 
in the expansion. The elements in the fourth row or column, not given above, 


are as follows. For the (w-+w,) contribution, 
(63a) (His) /(¥m) = 7(105/16)(wtoy)/(v), Hh = He = Ha; = 0. 
Also 

Ai = 0, 
(63b) H38.43,44/(/2 (v)) = 15/32, 309/128, 5657/1024 respectively, 
(63c) a 42,43,44/ (Ye) = 35/16, 165/32, 1077 /128, 2957 /256, 


BVM) YEN) 
Tass "#2 "IS "Fa "7 ee oS te ee 
+3(3_m)(5-m)(m-+1) : 


2 
jn) ei(f-n) -n)onen 
+1(3—m)(m-+1)(m+2), 
= saa") (3m) (Gm) +40G-m) Gm) orn 


+3(3-m) tiie 8)4 gmt 1) (m+2)(m+3). 


2 


Results were also calculated from the expansion up to r = s = 2. The values 
deduced from the 3 by 3 and 4 by 4 matrices are compared and tabulated 
in the Appendix, for variations of the electron collision frequency with 
neutral particles of the form v~’, v—?, v~!, v°, v, v?, and v3 (m = 0, 0.5, 1,..., 3), 
for various ratios of (vm)/{ve,) and for Z = 1. In all cases, the functions* 


e( (ve) £22) 


and h (of the same parameters) are tabulated and also plotted in Figs. 2 to 8a. 

s, all curves approach 1. Thus, one can add the 
equivalent electron collision frequencies, due to the two types of encounters 
(electron-ion and electron—neutral particle), and electron-electron collisions 


*The g and h functions for electron conductivity are denoted by gg and h, in the Appendix. 
Results for ym « v** are also tabulated in the Appendix based on the 4x4 matrix. 
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Fic. 2. Variation of g and h functions for ym « v~*. Legend: 


3 “3 
— Svs (vm) (ser v3, 


(vei) = 4(2r)'/2n4In A ( e y 


" ’ 
4reo 


“Bm (RT)? 
(vg) = (vm) + (vei); 


ouFen = ME (GopiccaN ; 
m \(vg)gt+j(wwr)h 
4X4 Matrix 


Curves 1 2 3 4 5 6 7 
(vm)/{vei) 0 0.1 0.5 1.0 5.0 10 100 


Curve 8 is exact evaluation for (vm)/(vei) = © using 
Dingle functions. 





do not contribute. In this limit, one can sum the resistivities due to neutral 
and ion effects. For example, in the absence of a d-c. magnetic field, the 
conductivity is 


(64) 1/o = (m/ne*) 2» [re +I] 


where the summation extends over all heavy particle species in the gas 
including ions, where the equivalent collision frequency is obtained from eq. 
(19a), namely 
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Co 
PT 


iO +10) 2 4 6ltsé 
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Fic. 3. Variation of g - h functions for vm « v~*. Legend: 


5 (vm) (sr) = 


42 2r)/2n, In A ay 
3m'/2(RT 3/2 \ Are 


(vg) = (vm) + (vei), 


(vei = 


= 7 (, ae 
ouFon in \Gpetiote)h)’ 
4X4 Matrix 


Curves 1 2 3 4 5 6 
{vm)/ (vei) 0 0:1 .025'..120 5:0 200 





Curve 7 is exact evaluation for (vym)/(vei) = © using 
Dingle integrals. 


ae ‘a Lote re af 
(64a) (79) = ( i 2 s = - v Ve dy 


and » is the elastic collision frequency for momentum transfer. For a single 
neutral species at high frequencies 


: eee (vet) + (Ym) — jw ne y v.)— 40 
(645) o = ne] teal ites) nes (Yet) + (Ym) — jw). 


At the opposite extreme of low |w-bw,|/(vg), the g and h# functions also 
approach limits which, however, differ markedly from 1, and depend on 
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_3vVr Noe. 
= (str “ 
(mi) = 


4(2m)'?ny In A (_ <\' 
3m'/*( kT 3/2 dre)’ 
(vg) = (Ym) + (ei), 


E ,2 
nF Jou = = ( ! 


Ggtiwtw,)h) * 
4X4 Matrix 


m 


Curves 1 2 3 4 5 6 7 
(vm)/(veid 0 0.1 0.5 1.0 5.0 20 100 


~ 





Curve 8 is exact evaluation for (vm)/(vei) = © using 
Dingle integrals. 


(vm)/(ve). No simple relationship exists for these limits unless the plasma 
is only slightly ionized or strongly ionized. In the former case, g and 4 approach 
the limits given by eqs. (20d) and (20e). One notes from eq. (20d) that for 
a slightly ionized gas and w = 0 


i in... : Lee) 
(65) om = 8(0)v) = i/(2 dv 30m/ 


which is the effective collision frequency for low w-tw,, derived in eq. (126). 





CANADIAN JOURNAL OF PHYSICS. VOL. 39, 1961 


10 +O Il 1-2 
——=—> h 
Fic. 5. Variation of g and h functions for ym a constant. Legend: 
im = (vm), 
(ng) = ACen s (3) 
re 3m'/2(RT )3/? 4reo) ’ 
(vg) - (vm) + (vei), 


. ne 1 
ouFjon = — | -—— —;) : 
- ; © (wasicceE 


4X4 Matrix 


Curves 1 = 3 4 5 6 7 8 
(vm) / (vei) 0 0:06 0:1 0.2.05 .10 688 0 


Curve 9 (g = h = 1) is exact for (vm)/(vei) = ©. 


If the gas is strongly ionized and Z = 1, the low frequency limits, namely 
g = 0.5093 and h = 1.2077, obtained from the 4 by 4 matrix, are in between 
the exact value g = 0.506 given by Spitzer and Harm (see Table Ila), and 
Landshoff’s values g = 0.5128, h = 1.207 (see Table I1d). In general, for 
any degree of ionization, the graphs show that if y, « v~’, gand dare in between 
the limits for the slightly and strongly ionized gas. On the other hand, if 
Ym & v" (positive powers of speed), g and hk vary from the limits for the slightly 
ionized gas to approximately 1 at (vm)/{vea) somewhere between 1 and 10, 
and then g and h deviate more from 1 as (vm)/(ve) decreases, until they 
approach the limits for a completely ionized gas. The explanation for this 
variation is the fact that vg, equal to v_+ve1, is approximately constant when 





SHKAROFSKY: TRANSPORT COEFFICIENTS 


io 610 2 
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Fic. 6. Variation of g and h functions for ym « v. Legend: 


_ es m \¥2 
in = ==" (Ym) (st) vy, 
A(27r)'/2 n In A e \2 
(08) = mi re) 
(vg) = (vin) + (vei), 


our jon = ne ( 1 
m \(vg)g+j(wtup)h] 


4X4 Matrix 


Curves 1 2 3 4 5 6 7 8 9 
(vm) / (vei) 0 0060.1 62 65 1060 8& @ 1@ 


Curve 10 is exact evaluation for (vm)/(vei) = © using Dingle 
functions. 


(vm) = (ve) and when v» varies as a positive power of speed. In other words, 
since va decreases with electron speed as 1/v* and vy, increases as v’, there 
exists a ratio, (vm)/(ve1), for which the sum of the decreasing and increasing 
velocity function results in a more or less constant vg with respect to velocity, 
and when this occurs, g and # are approximately 1. Consequently, g and h 
shift towards 1 and then deviate from 1 as the degree of ionization in the 
gas increases. For constant collision frequency, Fig. 5 shows how g and h 
approach 1, the limit for the slightly ionized gas, as (ym)/(ve) is increased. 

If the gas is slightly ionized ((vm)/{ve) = ©), the values of g and h should 
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4X4 Matrix 
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Curve 11 is exact evaluation for (vm)/{vi) = © using Dingle 
integrals. 


agree with calculations based on the Dingle functions (see Subsection (a)). The 
agreement is good for ym « v~’, v~!, v®, and v, but only in the range of 
|w-twp|/{vg) less than 1 if vy, « v~, and in the opposite range of |w--wp|/(v,) 
greater than 1 if vy, < v? or v’. The disagreement indicates that the conver- 
gence of the Laguerre expansion is inadequate for low frequencies when the 
power of velocity is equal to or greater than 2, and for high frequencies when 
the power is equal to or less than —3. Fortunately, the differences for low 
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Fic. 8. Variation of g and h functions for ym « v*. Legend: 
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holes a 2x)? m4 In J *( e ) 
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: ne? 1 
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4X4 Matrix 





Curves 1 2 3 4 5 6 ' 8 9 
(¥m)/ (vei) 0 0.05 0.2 0.5 1.0 5.0 10 20 100 


Curve 10 is exact evaluation for (vym)/(vei) = © using Dingle 
integrals. 


frequencies occur mainly in the / function, which multiplies the smaller factor 
|w-tw,|, and the differences for high frequencies occur mainly in the g function, 
which multiplies (vy,), again the smaller factor in this case. In the former case, 
h may, however, be important in estimating the width of the cyclotron 
resonance curve, and in the latter case, g is required for estimating the slight 
attenuation of the wave. 

Usually, Coulomb effects become important when (ym)/{va) = 10, which 





CANADIAN JOURNAL OF PHYSICS. VOL. 39, 1961 


100 


50 


30 
20 


10 


5:0 
3-0 
20 


+ =]. 2 od 


: 


> 
~ 
= 
> 























02 O06 10 
a 2 g 
Fic. 8a. Continuation of curves 9 and 10 in Fig. 8 for ym « v*, Curve 9 is for (vm)/(vei) = © 


evaluated using the 44 matrix. Curve 10 is exact evaluation for (vm)/(vei) = © using Dingle 
functions. 


corresponds to a degree of ionization of 0.01% to 1% for normal gases and 
temperatures. The ratio (v,)/(ve) becomes of this order at high temperatures 
due to the increase in ion concentration and also at very low temperatures 
due to the 7~-*/? variation in (vq) (see eq. (23a)). It can also occur in a pre- 
ionized gas, such as the F layer of the ionosphere. 


4. ENERGY FLOW AND TRANSPORT DUE TO GRADIENTS 

In this section, we shall investigate and provide results for the electron 
current due to gradients in electron density and in temperature, and for the 
energy flow due to these gradients and due to an electric field. The isotropic 
part of the electron distribution is assumed to be Maxwellian (not a displaced 
Maxwellian), and the analysis simplifies somewhat because of this assump. 
tion. Of course, for this assumption to be accurate, the gradients and electric 
field have to be sufficiently small in magnitude for linear theory to apply. 
Only transport due to d-c. forces is considered. The electric field in this 
section refers to an applied d-c. field.* 

The electron current flow 


(66) J = nev 


*The electric field, arising from induction due to the motion of the diamagnetic plasma, is 


omitted 
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is given by eqs. (76), (10a), (105), and (16). If we denote the intrinsic velocity 
by 







(66a) w=v-Vv 
then, since w = 0, the electron current due to thermal velocity is zero. 
(66d) Ju = new = 0. 






The pressure tensor is defined as 






(67) {p} = nm{vwv}. 
By eqs. (17) and (9d), the average is 
(67a) {p} = p{I} + an integral over {fz} 
where 
(67d) p = (4nm/3) f fou'dv = nmv* /3 
0 





and {I} is a unit tensor. 
For a Maxwellian distribution, 






(67c) b = (nm/3)(3kT/m) = nkT. 





Since we neglect {f:} terms in the expansion of the distribution function, the 
pressure tensor in eq. (67a) is diagonal. 
The thermal pressure tensor, {pu}, is defined as 





(68) {Pu} = nm{ww} = nm{vv} —nm{vv} 






and the non-divergent thermal pressure tensor {p,}, is defined as 






(69) {pia} /nm = {ww} —43(wew){ 1} 





(69a) = {vv} — {vv} —4o°{1}+4(v-v) {I} from eq. (66a) 
(69d) = {vv} ~4y*{ I} if the quadratic terms are neglected 
(69c) = 0 by eqs. (67) to (670), if {f:} terms are omitted. 






Thus, this analysis cannot yield {pj}. Since {pj} is related to the viscosity 
of the electron gas, this coefficient cannot be obtained. 

For a Maxwellian distribution, the electron energy is u« = myv?/2 = 3k7/2, 
and the electron current flow from eq. (74) is related to the electric field, 







to gradients in temperature, density, or pressure, by 






(70) J = {6} E—e{D} -Yn—{[e} WT 





(70a) = {6} *E—e(D} -wp/kT—{2'} 07, 





where eq. (67¢) is substituted for p, and where 






(71) {'} {e} ~e{D}n/T. 





lhe matrix {*’} is the current flow matrix due to thermal gradients at constant 
pressure and {#} is at constant electron density, {*’} is convenient for con- 
sidering Vp and {+} for Yn. 
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The total energy flow due to electron motion is 

= —— 16 

(72) H = mnv'v/2 


where from eq. (13), subject to a Maxwellian distribution, one has again in 
terms of the three forces E, Yu or Vp, and VT, 


(72a) H = {u} -E—e{Q} -Vn—{K} -vT7 

(726) = {yu} -E—e{Q} -Vp/kT—{K’} -VT, 

where 

(73) (K’} = {K} —en{Q}/T. 

The quantity {K’} is the energy conductivity matrix at constant pressure 
and {K} is at constant density, the former useful for Vp and the latter for 


Vn. 
The thermal energy or heat flow is defined as 


(74) Hy = mnw'w/2 = mn{v’v—v' v—2 {vv} ev+2(vev)v]/2 


since 
WwW =V-V, w = 0, and (vev)v = {vv} ev. 


Neglecting quadratic effects, eq. (74) reduces to 
(74a) Hy & mn[v°v—v' v—2 {vv} ev] /2 
and omitting the {f2} contribution, one has from eq. (17) 
(vv} = (0°/3) {1} = (kT/m) {1} 
and as a result, eq. (74a) becomes 
(746) Hy & mn|v’v —5v°v /3]/2. 
Substitution from eqs. (72) for H and eqs. (66) and (70) for J yields 
(74c) Hy = H—5kTI/2e 
(74d) = {un} -E—e{Qu} -Vp/kT—{Ku} VT, 
where 
(75a) {un} = {yu} —5kT{ 6} /2e, 
(750) [Qu} = [Q}—5kT{D} /2e, 
(75c) {Ky} = {K’} —5kT{2'}/2e = {K} —en{Q}/T—5kT {2} /2e+5kn{D} /2. 


The above definitions only deal with the electron contribution. Other co- 
efficients can also be defined, which are concerned with electron plus ion motion 
(Rosenbluth and Kaufman 1958). Although ion flow is important in certain 
cases especially in the net energy flow, it will not be considered here. 

In the calculations, we restrict ourselves to H, for which the analysis 
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applies exactly, rather than to the approximate expression for Hg. In experi- 
ments such as those performed at M.I.T. (Bekefi 1960) and at the University 
of Illinois (Goldstein and Sekiguchi 1958), the total electron energy flow 
rather than the heat flow is measured. Furthermore, one usually measures 
electron density (m) in discharge plasmas, rather than electron pressure (p). 
For this reason, the coefficients {K}, {e}, {6}, and {yu} (for E, Vu, and V7) 
are chosen for the calculations, and the other coefficients can be obtained 
from these by relations given above or those deduced below. 

For a Maxwellian distribution the following additional relationships exist 
between the coefficients.* 


(76) {[D} = kT{6}/ne?, {Q} = kT {u}/ne?, {Qu} = kT {un} /ne’, 
(77) lun} = Tie}, or = {y}—3kT{6}/2e = Tf}. 


The proof of these relations for any degree of ionization follows from eqs. (66) 
and (58) relating f; to fo. Let us define (as in eqs. (26)) the following complex 
quantities. 


(78) te = fitifiys jit = fiz—if,, 
(78a) V, = 0/dx+jd/dy, V, = 0/dx—jod/dy, 
(78d) a, =a;+jay, dy = ar—jay. 


The equations, analogous to (27) for this case, become 
(79a) { —jont+rg—[O]}f: = —a,0fo/dv—vVifo, 
(796) {jon t+vg—[O]}fu = —ay0fo/dv—vV.fo, 
(79c) {ye—[O] }fiz = —a20fo/dv—vV fo, 


where [O] is a linear integrodifferential operator which can be deduced from 
eq. (56). If the distribution is Maxwellian 


fo = n(m/2xkT)*? exp(—mv?/2kT) 


ll 


and 
(80) Ofo/dv = —(mv/kT)fo, 
(80a) vVfo = (vfiVn)/n+vVT (0fo/dT) 


(uf Vn) /n+ofiV T[—3/2T + (mv?/2kT?) ]. 


Suppose that eqs. (79) have been inverted, by some ingenious method, with 
the result that the distribution functions f,,,, are given by some matrix 
operator of velocity {O’}, acting on the right-hand side. One can then multiply 
alternatively by (4mev'/3) or (2rmv5/3), integrate over velocity, and obtain 
the coefficients of J and H. Upon comparing coefficients with eqs. (76) and 
(13), one finds that 

*The three formulas in eq. (76) are known as the Einstein relationships, and eq. (77) is 


known as the Onsager relationship. (See de Groot, Thermodynamics of Irreversible Processes, 
Interscience Publishers, Section 61, 1951.) 
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{6} = (4me"/3kT) fio }fov' du, 
(42 /3n) f {O'}fov' dv, 
(2ne/T) J 10%) (—14- mv? 387) fe dv, 


= (2ume/3kT) { {0'} fov® dv, 
{Q} = (2am/8ne) | {O'} fov' do, 


{K} = (4m/T) J {0% (—14-mv?/3k7)fe08 de. 


By inspection, and from eq. (71) defining {e’} and eq. (75a) defining {yx}, 
the Einstein and Onsager relationships in eqs. (76) and (77) result. 

The matrix elements of the various transport coefficients can be shown to 
be of the form given in eq. (18), by using an argument similar to that in 
Section (3c), and by considering, consecutively, transport due to E with no 
gradients, transport due to Vm with no V7 and no E, and transport due to 
VT with no Vz and E forces. Physically, this may be impossible (e.g., VT 
gives rise to Vm); analytically, they may be considered separately and later 
added, since the theory is a linear one. Furthermore, one can define for any 
degree of ionization, particular parameters, g and h, which multiply (»,) and 
(=:w,) in each coefficient (e.g. g, and hf, in the 7 coefficient, similar to g, and 
h, for the conductivity). To be consistent, the definitions of (ve) and (vm) 
are left unchanged (see eq. (19a)). This requires, however, that at high (wp), 
g approaches limits differing from 1. The limits depend on the degree of 
ionization and the power law variation of vy, with velocity. The function h 
approaches | in all cases. The transport coefficients are normalized in such a 
way that for constant vy, and a negligible degree of ionization, g and h are 
identically 1 for all (-:w,) values. This can be noted from eqs. (10) and (14) 
and from the following definitions for the coefficients. 


9 


ne 


82 Fjon = ——~___ 
( a) oe M ({Vg)So= jor Ne) 


(82d) -- (DirFjD 21) ’ 


___—nek 

mM (rg gr juph,) ’ 
____bnekT 

2m ((¥g Su jonlty) 


(82c) TuFjra = 


(82d) MuFjun = 


(82e) om (Qi1FjQa21), 
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5nk’T 





82 oe one 
(82f) ities M ((Y_) 8x +juphx) 


The (33) matrix elements are obtained from eqs. (82), by letting w, = 0. The 
g, and h, functions are identical with those derived in Section 3. Thus, for a 
feebly ionized gas, eqs. (20) apply, and for a partially and completely ionized 
gas, eq. (59) results from the Laguerre expansion. 

The g and / functions for the other transport coefficients in the limit of a 
feebly ionized gas obey the following equations, which are deduced from eqs. 
(10), (14), and (82). 


m|(d rv? ( 1 2 ‘| 
(83a) 3h iS ae) Wp aT vat -bont : 
3 e 1) = (4. a) L 
a dT Vm top” a (Ym)£r < AT Ym" +a" a h, ; 
(830) + 2 ie .) | 
~ UBNRT] Lom top a) \im toy / 2 
f (_9#) ie E es) 1 
ins Vm twp a (Vm) £u + Vn tw” an ; 
xo) Lm! ( d _ rao 7 ( do | 
(83¢) 30 | (Se Ew ef te a 


ir (< at) iil ta ‘= a e) - 
7 dT Vm top” a (Vm )£x - dT Yn tw a hx . 


If vm varies as a power of velocity, viz. 


— ¥%m)T(5/2) =. 
"ma = T((5+r)/2] \QkT 
then, g and h, in eqs. (83) at high and low cyclotron frequencies, approach 


limits given by eqs. (84). As w»)/(vm) approaches infinity, the limits for the 
components perpendicular to the magnetic field are 














(84a) h-=h,=hy = 1 
and 
(846) ge = (r+2)/2, Bu = (r+5)/5, Bx = (7 +5) (7 +4)/20. 


For the (33) components, and as w)/{v,) — 0 for the other components, one 
has 


£15 /2 _ eS) 
(84c) g- = rom —, r(2 a” 3 


Coed Tear 
8+ = &62/(2—1), h, 


he(1—r)/(1—r/2)’, 











ll 
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GS") 
2 
[ey 
7—2r 
20g, 1G )r (752 2 1S") eee 
G=n 47)’ T\y Ga 
[nf 5*) 


Numerical values for these limits are tabulated in the Appendix. One can 
also calculate the g and h functions given any w»/{vm) ratio, from the Dingle 
integrals (1957). These calculations, however, were not performed, except for 
the g, and A, functions. 

In the limit of a completely (singly) ionized gas in zero magnetic field, the 
various coefficients can be deduced from Spitzer and Harm’s results (1953), 
which include electron-ion and electron-electron collisions. Spitzer and Harm 
evaluate 7’ (instead of 7), u, and K’ (instead of K, see eqs. (71) and (73)). 
One finds that in terms of Spitzer and Harm’s corrections, namely y g=0.5816, 
yr = 0.2727, 52 = 0.4652, and 67 = 0.2252, the g functions are given by 


Zu = 152/25662 = 0.3957, Zo = 32/32yz = 0.5064, 
gr = 1/16yrz, and gx = 32/6467. 

From eqs. (71), (73), and (82), at w, = 0, 

1/g, = 1/get+1/g» yielding g, = 0.2973, 
1/gx = 1/2g,+1/gx yielding gx = 0.3581. 


(84e) fn = 


(85a) 


(85d) 


For a partially ionized gas and any w,/(vg) ratio, one can resort to the 
Laguerre expansion, using the method of Section (3e), to calculate the g and 
h functions. The analysis is now presented. All the equations in Section (3e) 
up to eq. (48) are correct. Equation (43) should read 


(90) 24 (M 4) py Zein vtw) = (L) Boar 


where the Laguerre polynomial, ie = 5/2-W, 
W = mv?/2kT. 


[lL—Li(W)]VT/T = (3/2—mv?/2kT)VT/T, 
and eq. (80a) for Vfo is substituted into eqs. (79) relating f,,, to fo. As a result, 
upon applying the orthogonality relation of eq. (38), eq. (44) is changed to 


Ss cE <n out jo» Tl (s+5/2) 
0) (GT inf Sat rG/2)r(s+1ee" 


=—)>) p.(i-+1191-441) 
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In the above equations, the forces are defined as in eqs. (78), that is, 
(87a) Vn = (Vn)iu = VnztjVn,, VT = (VT) .4 = VT,+jVT,, 
E=E,, = E,+jE,. 


Equations (45) to (54) remain unchanged. In eq. (55), corrections have to 
be inserted (see footnote p. 1644). Finally, eq. (55d) is 


(88) 


The matrix (A) is given in eq. (55c). If |Aoo|, |Ao:], and |Ay| are the deter- 
minants of the (00), (01), and (11) minors of (A), then one has 


(88a) si ee |Ao:| ei 


(v)L [Al a Ts ia 7 


(880) fi re Leif vn ”) ciealSe 


(v) |A} . if" 2m = 
The electron current is related only to po by eq. (57), namely, 
(89) Ju,, = nekTpo/m 
where J,,, = J:*#-jJ,. The energy flow can similarly be evaluated from eq. (13): 
Hu: = (2am/3)fv°fu,.dv = (2am/3)[fw' dD) psL.(W)dv 
= (4nk°T’/3 am) >, pJeE"wW’L,(W)dW 
= (4nk°T*/3./mm) >, p.fW*’e "L.(W)[—-Li(W) +5/2]dW . 
Applying the orthogonality relation given in eq. (38) 
(90) Hus = (Ank?T?/3-/xm)[— pil (7/2) + (5/2)poF (5/2)] 
= (Snk*T*/2m) (po— pi). 


Thus, from eqs. (88) to (90) and eqs. (74) and (13), the coefficients are given 
in terms of the determinants, by 





( at ne |Aoo 
(91a) O11Jo21 mv) [A] 9 


eT the 
(91d) Dat + (A| ° 


(91c) — toh {lel 5 aa) 


mv)\ |A| |A| 
(91d) a ne Beh (Ihe lel) 


tut der = 2 m@)\ [al [al 


° 5 kT Aoo Aoi 
aa QutjQn = ge (latte) 
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: - eige _ Smk*T( lool , 7 |Aos| atest) 
ae Kuikn = BH [a| 72 [al 2 [al 
The (1i) and (22) elements arise independently by addition and subtraction 
of the individual (-+F) equations. The (33) elements result by equating wy to 
zero. For example, 

icc a sees * [Aco (wy = - 0)| 

™ re |A(wp, = 0) 

Application of eqs. (71), (73), and (75) yields for the {e’}, {K’}, {un}, {Qu}, 
and {Ky} coefficients 
a, 5 nek |Aoi| 


‘ ’ o, (wi juoa 
(91g) TiJT21 = = (QuFjQa)a 2miv) |Al ’ 


z 


vg weaete 25 nk® r | Aoa| | Aul 
(91h) KiuFjK = . sae (ial (Al ++ jal]? 


5 nT ~ 
- 7 ears [A] ° 


From our definitions of the g and / functions in eqs. (82), one has 


) lA 
(92a) go = a Re (at) 


(912) (KiiFjKa)a 


i _ (Jar) 
(925) h, = ae Im [Acol/ ’ 


(92c) ye —— ih, 
(y Vg) 3 
doe +2 |Ao:| 


\ 
(92d) p= ial Bas , 
op 5 ‘ 
|Aoo| +5 | Aoi| 


wos ) R(_—__14l ), 
(92e) ~ (yg) Rel | Moo] + | Aor] 


(vy) ( L 2), 
92 = TALS 
( f ) +p, Im |Aoo| + | Ao1| 
(92g) c - 2 — 7 2 5 
vs) lawl +2 |Aoi|+5 |Aia| 


(92h) hx = _) Im ae: | . 
tap 7 5 
|Aoo| +5 |Aoi|+5 | Aai| 


At high d-c. magnetic fields, wp, >> ({vg)), the transport coefficients per- 
pendicular to the gradients or electric field, can be obtained from the limiting 
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values approached by the various determinants. In this case, the diagonal 
terms of the (A) matrix in eq. (55c) dominate. Write for convenience 


7; ne 

BB 
(93) (A) =]. ; Gy =; 

’ ; ‘ D 


Then 


(93a) |A| > ABCD..., |Au| > ECD... ’ 
|Aoo] > BCD..., |Aun| > ACD. 


Hence 
| Aoo| /| Al —?- Li A 
(93d) |Aoi|/|A| + E/AB > E/Re AB, 
|A1a|/|4| > 1/B. 
Furthermore, in this limit, it is easy to generalize the results to arbitrary 
variation of vy, with velocity, that is, not only a power of velocity. Note that 


from eq. (45a), to the first order in ¢ and 7, 


a —5/2 





Ps ici all oll ee. we 7 2) | 
vpn € iH ie )+¥ 5 te... 


me OESE/2+ 50/24 2500/8) Cy ayy — Weta) + Wt] aw 
1'(5/2) 


pei e (Wire) | gE 5(W'm)a_, (W¥m)a | 
a m)y 1+ +n) 3” (W»,,), +én 4 (Wade “+p i 


where W = mv?/2kT, 


- f.« v'ra) a y 
(94a) (Ym) = (3 as 2(W vm)a/3 


for a Maxwellian distribution, and an average over a scalar function of velocity 
is defined in eq. (9d), namely (¢), = 4 f frpv'dv/n. Thus, the generalized H™ 
matrix is found from eq. (94) to be 


1 se. (Wire) 

2 (Wm)s 
On em _ 5 (W'vm)s 25 5(W'm)s (Wrn)s | 
(95) Az = (vm) # a E (Wvm)at (W ¥m)e 


This He matrix is now added to H*! in eq. (49) and to H% in eq. (53), and 
the elements 4, B, and E in the (A) matrix, eq. (93), can now be identified. 
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A [Aco] — — 3, Me) 
(v) [Al a wp’ 


L [Aor _ _ 34a) _ 2 (vm) | $n 


Sink = 
IWp 


9 


Wb 


~~ (W tae | ; 


(96c) a(n) 3 44/2 (v) 


o.* = 
25 _5(Wrm)a Ym )a (Wire) | 
+m) 22 (W vm)a (Wm)at) 


The presence of the »/2 term in eq. (96c) indicates the only contribution of 
electron-electron effects to the high frequency limits of the transport co- 
efficients, and only the energy and heat conductivity coefficients containing 
| Ai;| are affected. Note also that these results are independent of the Laguerre 
expansion. This is so since, if all the terms of the Laguerre expansion were 
included, the results would be exact, but eqs. (93) show that the only terms 
from the infinite expansion, which are of consequence here, are the (00), (01), 
and (11) elements and these are given exactly in eqs. (96). Hence, identical 
results can be expected from any other method of solution (Kaufman 1960)* 
of the integrodifferential equations in the limit of high magnetic fields. 
Substitution of eqs. (96) into (91) yields 


(97a) Outjoon = Me (DutjDn) = af (aj ;4 Ye) wo), 


: ‘Lee jee] 1 on _Wra)s) | 
aia ere a a Se I 


Mp 
(97c) wiutjua = ae (Qiu+jQ21) 
5mekT J. ., 2¢ve)| , (vm) (W*m)a 
og Moy \ait® 5ap | 1—£ (-@ rae) It, 


Knu+jKu = bub T Flaite +; Ye) 2[142ya¥ ) 


mu (Y%) 
tea){ _ _3(Wom)a 2(Wrm)e) 
Ge) "Wade? (Wrned ID” 
Thtjra = > F uncejuene = =, (Qi+jQe1)n 
nek [ 3 a) ( ~(Wea)s) 
“a [Ba oe (Wn). 1” 


*Kaufman obtains identical results for a completely ionized gas, without the Laguerre 
expansion. The equations which correspond to his are indicated in the following footnotes. 
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hie EES pe S| _ 5 (v) 
(97f) KntjKn =~) +5tion 142/27 
(ral( _5(W'm)s 2(W'm)e) | 
FEV Ura” Gear 


* eo ee ks tr) 18 v2 ov) 
(97g) (KirtjKa)a = ty ot = te) 


vm) (3_ (W'm)a 1 (W'¥m)s | 
TG.) (2- (Wma 5 (W *=)s) 
(From eqs. (94a) and (51), (vm)/(vg) = (W vm)a/(W vg), and (v)/(vea) = n/n4Z?*.) 


Upon comparison with eqs. (82), it is seen that all coefficients except 7’, ux, 
and Quy, are proportional to 


(98) Ej / Ney +g (r_)/h*ap”. 


The h functions are 1 for o, 7, u, K and 2 for K’ and Ky. The limiting values 
of the g functions are 


(98a) g,=1, 


fae 5— a) | —(Wn)s| 
(98d) g, 0.5 (y.) 1 (Wry? 


( _ J _ Ym) _ (W'vm)s \ 
(98c) & = 0.4) 1 E aa It. 


(%«) (W vm)a 


” 2 ) (rm) | _ _ gq (Wms Wra)s | 
(98d) = 0.14+- S ta)’ Ge 0.1 03 iF ht” (a ; 


Ber _ Mao) tra) ae 2 Win)s | 
ae) eu eal oa eee 


. Exe ~~ (v) bra) | _ (W'vm)s ‘Vm)a (W"¥m)a |. 
Ow Beer Ty ay al ae 


If ym varies as a power of velocity, »_ « v’, then 

(99) (W*tm)a/(Wrm)a = (r+5)/2,  (W'm)a/(Wrm)a = (+7) (r+5)/4, 
and 

(99a) g. = 1, 


— xf —1E (+2) (lom)/ oa) 
(99d) 0. 5 - 1+ (Vm) / (Ver) , 


< 1+0.5(r+5) ((vm)/{Ver)) 
& = seis 1+ Gm) /(va) 


0.14 (x/2n/5n,Z° DAL +5) (+4) Ym) /20(ve1)] 
(99d) =™ 1+ (Ym)/ er) 


(99c) 
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As an example, let us consider the transport of a completely singly ionized 
gas perpendicular to a strong magnetic field, (wp/(ve)) >> 1. After inserting 
the coefficients in eqs. (97) into eq. (70a), the electron current becomes* 


(100) Jy: = JzFjJ, 
— (jneE,,,./B —j (VP) u,1:/B) = ((v,)m/eB?) (neEy..— (VP)u,t 
+3nk(VT)y,1/2) 


where w, = |eB/m|, and the (u,t) components are defined in eq. (87a). Similarly, 
the energy flow denoted in eq. (726), is 


i laa j5 nkT ORT 9 nk° rT ) 
(1002) He ~ H,*jH, > +(8m a — FE, t ~“j25 eB (VP)u, t SIS 9 a (vr du, t 


cE (neEy,1— (VP vet (0.5—V2)nk (Tw, 


and finally, the heat flow Hy = H—J5k7/2e, given in eq. (74d), reduces tof 


(1005) (Hud) = s/e (VT), - 3 nek, 


+5 (Wp)ui— (V2+13/4)nk(VT a, ). 


The j multiplying some terms signifies that the flow is at right angles to the 
force, in addition to the magnetic field. Note that the Onsager relationship 
is satisfied, that is the coefficient of E in Hy, namely ug, is equal to the co- 
efficient 7’ in J. 

If the collision frequency is negligible and no external electric field is 
applied, one obtains the simple well-known relation 


(100c) J, = VpXB/B? 


from eq. (100). 

Calculations based on the Laguerre expansion, up to a 4 by 4 matrix, are 
given in the Appendix. The g and h functions are tabulated versus{ the 
variable parameter |-tw,|/{»,) for components perpendicular to the magnetic 
field. For components parallel to the magnetic field, the parameter has the 
fixed value zero. The results are compared with 

(1) Spitzer and Harm’s values for w, = 0, a completely ionized gas, and 
Z = 1 (see eqs. (85)). 

(2) Spitzer and Harm’s results for w, = 0, a completely ionized gas, and 
Z = 2, which also applies (see eq. (62)) to w, = 0, (um) = (Wet), ¥m & ¥~* and 
Z=1, 2 = nx. 

(3) In the limit w, > ((vg)), the results are compared with eqs. (99). 


*Compare with Kaufman (1960), Section IT, eqs. (54), (52), and (104). 

tCompare with Kaufman (1960), Section II, eqs. (90), (144), and (145). 

{This variable parameter is written in the Appendix as |w+e»|/(vg), in order to apply to 
the considerations in Section 3 as well. For application to this section, set w equal to zero. 
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(4) Inaslightly ionized gas, (vm)/(ve) = ©, and if w SS ((vm)) the results 
are compared with eqs. (84). Exact evaluations can then also be performed 
for all w, using the Dingle integrals. Since these calculations have not been 
done (except for g, and h,), one cannot immediately predict in what ranges 
the Laguerre expansion converges well. 

The above results are identical with those evaluated by Kaufman (1960) 
from transport theory, in the limits of w, > ((vq)) and (vm) = 0. The pre- 
sentation here bridges the gap between Kaufman's analysis and Spitzer and 
Harm’s results for zero magnetic field. It also generalizes Landshoff'’s and 
Spitzer and Harm’s calculations to include neutrals, thereby extending the 
applicability of the theory to partially ionized cases. A limitation is the con- 
vergence of the Laguerre expansion in certain ranges of the parameters. For 
the electronic conductivity, the convergence is inadequate when | (w-w»)|/(v_) 
is less than or equal to 1 if », varies as a positive power of velocity, and when 
|(w--wy)|/(vg) is greater or equal to 1 if vy, varies as a negative power of 
velocity. Of course, for very large |(w-tw»)|/(vg), the exact limits are always 
reached. 
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APPENDIX 
TABULATION OF THE g AND h FUNCTIONS 
(For Z = 1, variable (vm)/{ve1), and vm « v~%, v~’, v—!, v*, v, v8/?, v?, v9) 


Ze, he functions are versus |w-tw»|/((vg)). Other g and / functions are versus 
|+-w,|/({vg)). Equations (82a) to (82f) define the transport coefficients in 
terms of the g and h functions, and eqs. (70) and (72a) give the electron 
current and energy flow in terms of the transport coefficients. 

Also, w, = |eB/m|, w = angular radio frequency, 


(¥%q) = Wm)-+ (et). 


If vm = cv’, then 
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NORMAL MODES OF VIBRATIONS OF CRYSTALS! 


I. V. V. RAGHAVACHARYULU 


ABSTRACT 


The general theory of the normal modes of vibrations of crystals as given by 
Max Born and von Karman has been analyzed group theoretically. This is 
illustrated with reference to the diamond lattice. 


INTRODUCTION 


Group theory has been successfully applied to crystals in order to classify 
the normal modes of vibrations of molecules. The extension of this method 
to crystals was first tried by Venkatarayudu and Bhagavantam (1939) when 
they classified different modes of vibrations of crystals with respect to the 
representations of point groups. This is usually known as factor group analysis. 
Halford (1946) improved upon their classification by site analysis. Winston 
and Halford (1949) discussed the classification problem under different 
irreducible representations which they set up for space groups. As was later 
pointed out by Winston (1956), these representations actually correspond to 
point space groups in which the primitive translations are submultiples of the 
primitive translations of some other point space groups. Moreover, it is also 
recognized that site analysis is identical with classifying the normal modes 
of vibrations with respect to the underlying point space group or symmorphic 
group of the crystal. 

On the other hand, the reduction of the dynamical matrix of a crystal was 
successfully accomplished by Yanagawa (1953) group theoretically. He made 
use of the irreducible representations of G*/T which necessitated the con- 
struction of matrices that reduce the dynamical matrix. The underlying idea 
in this method is that a space group G which contains screws and glides (not 
simple) is a subgroup of a suitably chosen symmorphic or point space group 
G’. So the representations of G (not necessarily irreducible) subduced by the 
irreducible representations of G’ can be found. The matrices which Yanagawa 
constructed to reduce the dynamical matrix are precisely those which reduce 
the representations of G = O; (in his paper), obtained as suggested from 
G’ = Oj. In fact, it is usually difficult to reduce the representations of G 
subduced by the representations of G’. 

In this paper the reduction of normal modes of vibrations of crystals is 
studied with respect to the representations of space groups making use of the 
small representations of G*. The site analysis will be identical with the method 
suggested here only when the space group is a point space group. For this 
reason the method suggested in this paper may be referred to as Seitz’ analysis 
since he has initiated the study of representations of space groups. 

1Manuscript received November 4, 1959. 
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In actual application of group theory to the normal modes of vibrations, the 
irreducible representations I; of the symmetry group G of the dynamical 
system are to be obtained first. Then all the symmetry modes of vibrations of 
the dynamical system which belong to different irreducible representations 
are set up. Here the projection operator technique of Melvin (1956) consider- 
ably simplifies the work. The symmetry modes of vibrations are then used to 
get the normal modes of vibrations which depend on the forces acting between 
different atoms of the dynamical system. This method is directly extended to 
crystals. 


REPRESENTATIONS OF SPACE GROUPS 

The symmetry groups of crystals are space groups. Every space group 
contains as a normal Abelian subgroup the translational group 7. Its bounded 
one-dimensional representations A(k) are labelled by the wave vectors k of 
the reciprocal point space of the crystal and are given by A(k): t — exp 
2rik.t when the co-ordinate axes are along the Bravais axes of the crystal. 
Two wave vectors k and k’ are said to be equivalent when they differ by a 
full lattice translation in the reciprocal space. Obviously two equivalent wave 
vectors define identical representations of 7 (Lomont 1959, p. 201). 

Now, let us recall some important steps in the method of constructing 
irreducible representations of space groups by the little group technique (see 
Raghavacharyulu 1961, quoted as I in what follows). First, the nonequivalent 
wave vectors k of the translational group 7 are classified into stars {k} under 
the operations of G/T. Then the little group G* of the second kind belonging 
to one k of each star {k} and its small or allowable representations are found. 
Irreducible representations of G can be induced by all the allowable representa- 
tions of G* and all representations of the space group G are found exactly 
once in this way. 

Further, let us make note of some more miscellaneous results (R's) which 
are used later. 

Ri: If p is the number of nonequivalent wave vectors of a star {k}, then 
pN(G*) = N(G) and pN(G*/T) = N(G/T) where N(G) is of the order of 
group G. 

R2: The groups G*’s belonging to different wave vectors of a star {k} are 
conjugate subgroups of G and so isomorphic. 

Let D;(g) be the ith allowable representation of G* of order / and {D,(g)}? 
the induced (k,z)th representation I'(k,z) of G with respect to a coset decom- 
position of G = G¥g,+G*g.+ ...+G*g, where gi = E, the identity of G. 

R;: In the induced representation {D,(g)}" of G, excepting the elements in 
G*, all the other elements of G have zeros in the first leading minor of degree /. 

R,: In the representation I'(k,z), the characters y* (g) of the elements of G 
other than in the groups G*’s, for different k’s in the star {k}, are necessarily 
zero. 

Further, continuing the same notation as above let G** = g,G*g7! when 
G* = G*, Let A be an element of G*' and B an element of G** and the con- 
jugacy relation between the two groups G** and G*' be explicitly given by 
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B = g,Ag;'. Now, define the representation Dj(B) of G** by Dj(B) = Di(A) 
= D,(g7'Bg,) when Di(A) = D,(A). Let g be a general element of G and 
'ykr(g) = trDi(g) if g belongs to G* and zero otherwise. Then 


Rs :'W" (B) = Wi" (g-Agr') = 'WE(A) = 'VE(A), 
Re: ¥i(g) = 2» "Vt (g). 


STARS AND ALLOWABLE REPRESENTATIONS OF DIAMOND LATTICE 
It is well known that the diamond crystal belongs to the space group O; 
whose underlying point space group is O; and reciprocal point-space group is 
O%. The stars {k} of the irreducible representations of J relative to O) are the 
equivalent points of O} which are given in the International Tables for X-ray 
Crystallography (1953, see I). In it the space groups G*’s are also specified. 
A few of these stars of the wave vectors k = (k,, k,, k,) dropping k are 


b 4/mmm OW «40k HHO, 
g mm x03 3x0 03x x30 Ox} 40x 
£03 420 03% #30 0x3 40Z. 


These are referred to as crystallographic axes. So the irreducible representa- 
tions of T associated with these k’s are given by t > exp 4rik.t but not by 
t — exp 2z7k.t, for the wave vectors 000 and 333 which are given as equivalent 
points in the body-centered lattice O} define the trivial representation of T. 
The allowable representations of G*’s are calculated for the first wave 
vector of each star as given in the International Tables under O} by the general 
method illustrated in I. The notation adopted for the operations of O, is the 
same as that given in the International Tables with center at 000. If x, y, z 
are the co-ordinates of a general point with respect to the crystallographic 
axes, the symmetry operation of the space group is given by the co-ordinates 
x’, y’, 2’ into which x, y, z is transformed by the operation. In Table I the 


TABLE I 
Notation for the basic operators of Oj with center at 000 


| 


R 
ne 

LST | 
& 


| 


sxy Cy yex Cy xzy Ord yxz zd) ZYX yd 
xyS C, sty @§©6C,(3) yx C;(4) xzy Ordo yx Sis 2yF Sy 
XYZ 2 %2%y 3 Vox 3 xXZy 4c XS Feds 29x Sey 
L—yl—yi—z i t—st—xi-y Ss to-yt—-si-—2% Ss }—x}—2 3-9 Ga 
A—xityitz oz 4-2 i+xity Se g-yiteitx Ss t—-xitsity Cra 
ttxi—-yits oy 4tet—xity So ityt—-zitx Seo tt+xi-zity Ce 
Liyityi-g og, 4t+zt+teity Se ityttzi-x Ss i+xi+si-y Ce 
t—y 4—2% .—s Cray t—y 4x i+2 Cos a+y i—x A+z2 Cus i+y t+x}4-—2 Cais 
t—2 t-yi-—x Cua t—sityitx Cy i+st-—yitx Cy, t+eit+tyi-x Cy 





description of the basic symmetry operations of 0; along with the symbols 
for them are given except for translations and their products with the basic 
operations. The description of these operations is quite obvious. Table II 
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TABLE II 


Notation for matrices 


w = exp(277/3) 


e(x) = exp mix In = nth order unit matrix 
e(x) 0 1 0 
Jx)= (oO ~an)) a* (5 -1) 
J"(x) = iJ(x) J' =iJ 
. w 0 1 0 
K=(5 &) s= (3) 
0 1 0 1 
P(x) = Ci _ em G 0 
P'(x) = P(x+1/2) P’ = P(1/2) 


ers -1 0 0 
P;={0 0 1 R; = 0 0 1) 
r @.¢ e 2:3 


contains the symbols adopted for the matrices in the allowable representations 
of G*’s of Oj. Table III contains the allowable representations of G*’s. Making 
use of the Tables II and III the allowable representations of G*'s for different 


TABLE III 


Allowable representations of Gk’s for different wave vectors* 











Generating Allowable Number of 
elements representations representations 
Stars of Gk’s of Gk’s under each star 
a C3 1 , K ’ P3 
Ozd2 +1 ’ FP ’ +R; 10 
t +1, +1, +]; 
b Sex rt i 
G J a Oe 4 
a P » w&tP’ 
c Cs 1 ’ K 
Orxd, +1 ’ P 6 
t +1 , +12 
d Ses +S 9 
Oy P(3/4) 3 
e Cu ste(x), J'(x) 5 
Cy +e(x), P(x) 
if C; 1 > Ce 3 
Crd +1 , F : 
g Cz J 1 
Cy P'(x) 
h Oz +e(x) 4 
Oxdy +1 
t Coa +1 2 
j oz te(y+z) 2 
k Ozd1 +1 2 
l E 1 1 





*Distinct allowable representations of Gk’s which differ only in signs are given 


in a column. 
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wave vectors k can be explicitly written very easily. For example, the two 
allowable representations ['; and Tr, of the group G* of the first wave vector 
of the star d specified by Sy, ~ +S and o, — P(3/4) in Table III are given by 


ce 
y << an 
ls a. 
y as oe 


in terms of the generating elements S,, and o, of G* and the matrices given 
in Table II. Similarly, in the case of other wave vectors, the allowable rep- 
resentations of G*’s can be obtained. 


PROJECTION OPERATORS 

Once the irreducible representations of the symmetry group of a dynamical 
system are completely determined, there exists a general method of obtaining 
the basis functions belonging to a given irreducible representation. This method 
is based upon the construction of a set of projection operators, also known as 
idempotent operators, which project the appropriate basis functions when 
applied to a given set of symmetry functions and the construction of nilpotent 
operators which are useful in completely implementing the idempotent 
operators. 

The general theory of the method was first presented by Wigner (1944, 
1959). (For a recent survey of the method, see Lomont 1959, Chap. III, 
Section 9.) The less powerful methods of Eyring, Walter, and Kimball (1944, 
p. 189) are particular cases of this general method. Applications of. more 
general methods have been introduced by Nielson and Berryman (1949), 
Meijer (1954), Lomer (1955), and Melvin (1956). 

Let IT be an irreducible representation of G = (g) of dimension 1. Suppose 
we have a set of functions f* such that under all the operators of G the 
functions mix with each other according to 


A 5 xe Ste Siete, ot SMM os ead \g 


where & = [€:,| is the matrix corresponding to g in I’. Then f‘ is known as a 
symmetry adapted function belonging to the 7th row of the representation I. 
Now consider the operators 


P,(T) = z; Big 
g 


where $7; is the complex conjugate of $,;. The application of the operators to 
various symmetry functions transforming under G produce linear combinations 
of them which are the symmetry-adapted functions appropriate to the irre- 
ducible representation T. When z = j the above operators are called projection 
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operators or idempotent operators and the rest nilpotent operators. It should 
be pointed out here that the idempotent and nilpotent operators as we have 
defined them above are correct except for the group theoretical normalization 
factor 1/N(G). However, the linear combinations of wave functions produced 
by these operators will, in any event, require separate normalization. For 
this reason, we prefer the more simple definition adapted here. It can very 
easily be proved that for any symmetry function f of the group G the function 
L gal is the symmetry function belonging to the ith row of the representation 
iy 

In the implementation of the projection operator technique much simplifica- 
tion can be achieved by noticing (i) that the projection operators can be 
written as the product of commutative factors and (ii) that they can be 
reduced to projection operators over subgroups H of G (devertebrate simplifi- 
cation). 


FACTORIZATION OF THE PROJECTION OPERATORS 

Here, Melvin’s theory (Melvin 1956) will be presented in a slightly modified 
form suitable for our present purpose. Let I be an irreducible representation 
of G. Consider all elements g of G whose representative matrices ¢ contain a 
lone nonvanishing diagonal element in a given row, say, the 7th row. It can 
easily be proved that this set of elements of G forms a subgroup of G and we 
shall call it the representation group G‘(T) of the 7th row of the representation 
I. When the lone nonvanishing elements are either 1 or —1, this group G‘(T) 
is called a prokernel group and when the lone nonvanishing elements are equal 
to unity then G‘(T) is called a cokernel group. Moreover, the intersection of 
the cokernel groups G‘(T) for all z is called the kernel group of the representa- 
tion I’. It is easily seen that representation groups contain prokernel groups 
as normal subgroups which, in turn, contain the cokernel groups as normal 
subgroups and they in their turn contain kernel groups as normal subgroups. 
The simplification theorems which Melvin has proved for kernel, cokernel, 
and prokernel groups also hold for the representation groups and hence they 
can also be factorized in the same way. 


DEVERTEBRATE SIMPLIFICATION 


In the construction of the projection operators )>$7ig it might so happen 
that %,; vanish for all the matrices of the elements g of G in the representation 
Tr except for those belonging to a subgroup H of G. Obviously }g7ig will be 
confined to the subgroup H of G. This type of simplification is called, by 
Melvin, devertebrate simplification. For example, the general projection 
operator of the jth row in the (k,z)th representation I'(k,z) of a space group 
G is given by 


P,,(k,i) = D> gfe, 


where the summation is over all the elements of G. Let / be the order of the 
ith small representation of G*, Then from R3, devertebrate simplification is 
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possible here and the projection operator reduces to that of the group G* 
corresponding to ith small representation if 7 </. The above projection 
operator becomes 


(1) P4,(k,i) = Do gig for j < /, 


where the summation is now over the elements of G* only. 

Let us consider a simple application of equation (1) which is useful later on. 
Corresponding to each wave vector k there is a one-dimensional representation 
A(k) of T where A(k) : t ~ exp 2aik.t. Making use of (1) the projection 
operator of T for the representation A(k) is given by 


(2) 2 (exp — 27ik.t,)t;. 
i 


The general operators (1) are required when point functions are used to 
construct the symmetry adapted functions of the system. But the symmetry 
adapted functions of the translational Abelian subgroup 7 of any space group 
can be constructed very easily. These symmetry adapted functions can be used 
in turn to obtain the symmetry adapted functions corresponding to the space 
group G. If such a procedure is followed, the projection operators need only 
be constructed over the elements of G¥ mod 7 which are denoted by Q,,;(k,7). 
Moreover, in the study of normal modes of vibrations of a crystal, for example 
the diamond, one need only obtain the symmetry modes corresponding to a 
row of each one of the irreducible representations ['(k,z) of the space group 
G of the crystal (Bhagavantam and Venkatarayudu 1952, p. 80). Hence, it 
is enough if one constructs the projection operators Qi:(k,z) corresponding to 
the first rows of the allowable representations I’; of the groups G*. 

The construction of these projection operators is quite simple. For example, 
the projection operator Q:;(d,2) in the case of the diamond lattice is given by 


Qu(d,2) = E+C,—Se— Su 


making use of the representation IT, of G* and of equation (1). Further, 
Qi:(d,2) can be simply written as —S,4, in Melvin’s notation. In this notation 
if G is a cyclic group of order n, generated by the element C, then aC stands 
for E+aC+a*C?+ ...+a"™—'C"—! where E is the identity element of G. 
Moreover, +'C is simply written as +C. 

In the same way the projection operators Q1:(k,i) of the other allowable 
representations in Table III of the space group O, are constructed for the first 
rows and are given in a factorized form in Table IV. Obviously, only one pro- 
jection operator should correspond to each allowable representation of the 
groups G*’s. The correspondence between the allowable representations in 
Table III and the projection operators in Table IV is easily seen by considering 
the one-dimensional allowable representations of G*, G*, G’, etc., in Table 
III and their projection operators in Table IV. 
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TABLE IV 
Projection operators for the first rows of the allowable representations 
of Gk's* 
Stars Projection operators 
a C3 .+¢2dy .+(o2.oy.02); “Cs.+(o2.0y.02); +ozd,.+(i.~Caz) 
b +S,..C,; Sy, te, 
c C;.+02¢, . +i; #C; .+i 
d +S, 
e toZ)C,, s telz)g, ie) Cy. 
f C;. tora ; °C; 
£ Cc, 
h +elz)g ond 
t +Cya, 
j telyte)g, 
k tend 
l E 





*Note: +A.+B stands for the four projection operators A.B, ~A.B, A.~B, 
and-A.-B. +(A. -B) stands for the two operators A .~Band ~A. B. 


CLASSIFICATION OF NORMAL MODES OF VIBRATIONS 


Classification of the normal modes of vibrations of a crystal among different 
irreducible representations of its space group is unnecessary in principle when 
one uses the technique of projection operators and Cartesian co-ordinates. 
But the reduction is intrinsically valuable in itself when one does not want 
to solve the dynamical problem completely, but merely wants to study the 
gross features of the spectra of crystals. Further, the reduction is useful in 
finding the number of redundant symmetry co-ordinates which occur in 
abundance when one studies the vibrations of crystals making use of the 
internal co-ordinates. Moreover, as the information can be obtained as simply 
as in the case of molecules, it may be suggested that the reduction can very 
well be used as an effective check on the general method of projection oper- 
ators, for the number of normal modes that come under an irreducible repre- 
sentation and the dimension of the dynamical matrix to be solved are the same. 
For example, the number of different normal modes mg that come under the 
second allowable representation I’, of the group G* of the star d is two (Table 
V), which is also the dimension of the dynamical matrix given under the star 
d and the second allowable representation of G* (Table VIII). 

The method of classification of normal modes of vibrations in the case of 
molecules is quite well known. Let G be the symmetry group of a molecule 
and I, its zth irreducible representation. Let ¥,(g) be the character of the 
element g in the representation I’, of G. We apply all the operations correspond- 
ing to the elements of G to the molecule. Under g let U, be the number of 
atoms of the molecule that remain invariant in their positions and ¢, the 
angle through which the molecule has rotated. Calculate the compound 
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character ¥(g) = U,(+1+2 cos ¢,) corresponding to the element g where 
the + or — sign is to be taken according to whether g is a pure rotation or a 
rotation reflection. Now, the number 7, of different normal modes that come 
under the irreducible representation T; is given by (Bhagavantam and 
Venkatarayudu 1952, p. 76). 


~— ak x v(evite). 


This method of classification is true for any finite dynamical system and 
hence can be directly applied for any finite dimensional crystal under Born 
and von Karman’s cyclic postulate. According to this postulate it is assumed 
that the physical properties of a crystal are periodic. Let Ai, Az, and A; be 
the basic lattice vectors defining a unit Bravais cell of the crystal and Mu, 
Ne, and N; be a set of positive integers which are sufficiently large. Then the 
physical properties of a crystal are assumed to repeat themselves in blocks with 
edges A,Ni, A2N2, and A3N; when the order of the translational group T 
becomes V,N2N3 = N(T). 

To find the value of ¥(g) corresponding to the symmetry element g of a 
space group G first let us apply g for the atoms in any particular unit cell of 
the crystal. Let U, atoms of the unit cell be left in their original positions. Of 
course, U, is independent of any particular unit cell which is used in counting. 
So the contribution to ¥(g) from any unit cell is U,(+1+2 cos ¢,) = y’(g), 
say. If the crystal is finite under the cyclic postulate, the total contribution 
to ¥(g) is N(T) U,(4+1+2 cos ¢,) where N(T) is the order of 7. Now, the 
number of different normal modes n(k,i) that come under an irreducible 
representation I'(k,z) of the space group G is given by 


n(k,i) = wr DL v(g)¥in(g) 


=] 
i the summations are finite and from R, and Rs) 


> 7 (B 
as Ld vB)" 


Pp 


“— —1 
es dX a v’ (gAgr')'yi™ (g,Ag,') 


1 * 
(3) = NGUT) x v'(A)'Wik(A) 
(since ¥'(g,Ag;') = W'(A) and from R; and R;). 


Further, let G* be the group of point operations of G*. Then the above 
sumination need be taken over G* only as y’(g) is zero for screws and glides. 
Since G* is contained in G*/T, a point group, the calculation of the number of 
normal modes of vibrations that come under an irreducible representation 








i 
| 
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I'(k,z) of the space group G is as simple as in the case of molecules. Moreover, 
it is easily seen that 2(k,7) is the same for all the stars which have the same 
G*’s. This is because the point operations of G*’s only count in equation (3). 

As a trivial application of the above theory consider a crystal which contains 
s atoms per unit cell. Then, at a general wave vector k, the group G* is identical 
with T and as such from equation (3) there are 3s distinct normal modes of 
vibrations that come under the irreducible representation of G corresponding 
to k. Let us further illustrate the theory when k is the first wave vector of 
the star d of Oj as given in the International Tables. The group of this wave 
vector is given by G4? and N(G‘/T) is 8. It has two allowable representations 
lr, and ls. The character table of G* for these two representations can easily 
be obtained and is given in Table V. In it all the operators of G* are omitted 
for which the characters in both the representations IT, and I. are zero. 
Further, ¥’(g) is calculated and is also specified in Table V. Now, applying 
equation (3), the number nz of the normal modes of vibrations that come 
under the representations [; and Ils are calculated and are given under nm, 
in Table V. It shows that one normal mode comes under I; and two under 
I's. Similar results can be easily obtained for other allowable representations 
of oO} as well. 


TABLE V 


Number of normal modes mg that come under different 
allowable representations of G4 
(order of G4/T is 8) 


E as a Na 
W(T)) 2 1+ 1-1 1 
W(T2) 2 mk = 4 —1+i 2 
v'(g) 6 —2 om 


SYMMETRY MODES OF VIBRATIONS OF CRYSTALS 

Let us consider a crystal which contains s atoms per unit cell. The equili- 
brium position of the ath atom in the ith unit cell can be taken as fa+t; 
where fa is the vector distance of the ath atom from the center of the ith unit 
cell and t; is the translational vector from the origin of the co-ordinate system 
to the center of ith unit cell. Here, i is not a scalar quantity but contains 
three components corresponding to the three primitive translations of the space 
group. Let us consider a displacement in which the ath atom in the ith cell 
is displaced from its equilibrium by a small amount @(a,i) = (u1(a,i), u2(a,i), 
u3(a,i)). Then all possible symmetry modes of vibrations of a crystal can be 
expressed as linear combinations of the components of @(a,i) where — © < 
(i,,i2,i3) < © and a = 1 to s. And the normal modes of vibrations are linear 
combinations of symmetry modes of vibrations with coefficients which depend 
upon the force constants of the crystal. So, we first set up a general method to 
obtain the symmetry modes of vibrations of crystals. 

Every space group contains an Abelian normal subgroup 7 which is the 
group of translations of the crystal. The symmetry modes of vibrations 
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corresponding to the translational group T are very easily constructed. 
The projection operator of the group 7 corresponding to the wave vector k 
is given by >>\(exp—2zzk.t,)t, from (2) which, when applied to the displace- 
ment u,(a,j), gives 


> (exp — 2rik.t) tiv, (a,j) = > (exp — 2rik.ti)uy(a,i+j) 


i i 
= (exp 2mik.tj) W,(a,k) 


where W,(a,k) = Dj (exp—2zik.t,)u,(a,i). The components of W(a,k) = 
(W,(a,k), We(a,k), W3(a,k)) for different a and k may be called translational 
symmetry modes of the crystal and are the well-known Bloch sums which 
form a suitable basis for the reduction of the group 7. For a given k there are 
3s such sums corresponding to the s atoms in a unit cell of the crystal. Making 
use of W,(a,k) as basis functions we can obtain all the distinct symmetry 
modes of vibrations that belong to different irreducible representations 
I'(k,z) of the space group G with the help of the projection operators Q,;(K,7i). 
It may be noted that when a symmetry operation of the space group G is 
applied to a Bloch sum W,(a,k), p, a, and k all change. However, to construct 
the symmetry modes of vibrations that come under the representation I'(k,z), 
Q,,(k,z) are to be operated only on W,(a,k) for different values of a and p as 
W,(a,k) are the translational symmetry modes corresponding to k. In such 
a case, Q;;(k,7) does not change k in W,(a,k). 

To illustrate the actual method we deal with the diamond crystal. Diamond 
consists of two carbon atoms per unit cell whose positions may be taken as 
000, 334 in Oth cell corresponding to the description of O; in the International 
Tables (1952) with origin at 000. The symmetry modes of vibrations of the 
crystal that belong to the first rows of the different allowable representations 
of O; can be obtained by applying the projection operators in Table IV on 
the translational symmetry modes W,(a,k) where a = 1, 2. As an example, 
we construct in detail the symmetry modes of vibrations corresponding to 
the first row of the allowable representation of the star g. Let us take W (a,k) 
= (Xa, Ya, Za) where a = 1, 2. The projection operator in this case is C, 
where C, stands for E+C, in Melvin’s notation. C, transforms 000 into 000 
and $34 into }—4—} or (O—}—3)+ (443). The first arm of the star g is given 
by x04 with respect to crystallographic axes. As such, C, transforms (X1,V1,Z1) 
into (X,,—Y;,—Z;) and (X2,¥2,Z2) into exp 4mi[(x03)- (O—3—4)] (Xe, — Yo, 
—Zz2) i.e. into (—X2,V¥2,Z2) where X1, Y1, Z; and Xo, Yo, Z2 are calculated at 
k = x03. When E+C, is applied to (X1,V1,Z;) and (X»2,V2,Z2) we get (2X1,0,0) 
and (0,2Y2,2Z2) respectively. So, the symmetry modes of vibrations in this 
case are X,, Y2, and Z2 except for the normalizing constants. Similarly the other 
symmetry modes of vibrations corresponding to the first rows of all the allow- 
able representations of G* in Table III can be obtained by making use of the 
projection operators in Table IV and are given in Table VI. Further, in the 
actual application of these symmetry co-ordinates for the reduction of the 
dynamical matrix, it is enough if we normalize them with respect to X1,¥1,2; 











5 
{ 
4 
} 
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TABLE VI 


Symmetry modes of vibrations of diamond that belong to the first rows of the allowable 
representations of Gk’s* 


"Operators: that 
give rise to symmetry 











Stars modes of vibrations Sy mmetry modes of vibrations 
a Oza, .*(4.— Cos) Xi:+X2 
b Sar. co Xo 
Suz. tay (¥i+72Z1;)+(i ¥2+Z2) 
c C3. ora, -*t (Xi + ¥i+Z1)4(Xe+ ¥2+Z2) , 
“Cs. % (Xi+ Y,+w?Z1)+(X2+w Vot+w?*Z>2) 
d See Yo — 1Z2 
“ee X;1, YotiZ2 
e +(e) Cy, ez) gq.) + e(x)X 4X2 
W)C, —ie(x) VitZo, ie(x)Z:+ V2 
xz C3. ord, Xi+¥i+Z1, X2+ Y2+Ze 
“C; Xi+w ¥i+w?Z1, X2+w Y2+w?Z2 
g Cz Xi VacZs 
h se(2x) gy Ord se(2x)X1+NXe,te(2x)( ¥i1+Z1)+( ¥2+Z2) 
se(2z)g. “Ordy te(2x)( Y 1 waa Stel =: 2—Z2) 
1 *Cra X1FXo, Y\¥2Z2, Z:1* Y2 
j selytedg, Fe(y+s)XitXe, te(y+z) Vit Vo, te(y+2)Zi+Z2 
k Fz Xi + Vi, 21, X2+ Yo, Ze 
“Ordy =i X:— Ys 
l E X1, Vi, 21, X2, V2, Z2 
*NoteE: #A: X+Y,X 1 + Y’ means that ‘the projection operators A and — -A give rise to the syminetry modes 


X+ Y, X’— Y’ and y= y X’'+ Y’ respectively. 


and X2,¥2,Z2 as the equations of motion of the diamond crystals at a general 
wave vector kK can be reduced to equations in terms of them. The normalization 
constants with respect to X1,¥1,Z; and X2,VY2,Z2 of the symmetry modes in 
Table VI are quite obvious. Moreover, in Table VI the projection operators 
that give rise to different symmetry modes are also specified. The rest of the 
projection operators of O; project no symmetry modes of vibrations of diamond. 


VIBRATIONS OF CRYSTALS AND REDUCTION OF THE DYNAMICAL MATRIX 
The fundamental theory of vibrations of crystals was first given by Max 
Born and von Karman. Its main features can be presented in quite a simple 
form. We proceed by classical mechanics to set up Newton’s equations of 
motion for elastic vibrations of the atoms of the crystal acting on each other 
by linear restoring forces. Let the ath atom in the ith unit cell be given a 
small displacement &(a,i). Then forces will be coming into operation which 
are linear combinations of the displacements of the atoms. That is, we have 


F,(a,i) = » Coq(o,8,5,4)04(8,5), p = 1,2,3, 


where F,(a,i) is the pth component of the force acting on the ath particle 
in the ith cell. The coefficients C,,(a,6,i,j) in the above relation depend on 
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the components ~,g and those atoms in theithand jth unit cells that are being 
considered. These C’s are called the force constants of the crystal. 

Now the Newton’s equations of motions can be obtained very easily. If 
Mz, is the mass of the ath atom in a unit cell, we have 

matip(ad) = Food) = D0 Coa(arBiit)ue(B,i). 
1@ 

For different a, p, and i these are the most general equations of motion if no 
restriction is made on the force constants of the crystal. But, as the crystal 
owes its structure to the symmetry of its atomic forces we assume that the 
force constants are such that these equations remain invariant under all the 
symmetry operations of the space group of the crystal; otherwise there is no 
implication in the equations that the forces are derived from central forces 
etc. So, operating the above equations with a general translation t, we have 


Metiy(a,itr) = F,(a,itra) = > Cyq(a,8,5,i) 9 (8,j +2) 
B,a.j 


which shows that 


Mali, (ai) = F,(ai) = DS C,(e,8,j — 2,4 — 2)u, (8,5) 
Baj 


is true for all values of 4. As such C,,(a,6,j,i) is a function of j—i. So the 
equations of motion now become 


(4) Matiy(a,i) = F,(a,i) = D0 Cye(a,8,j — i)ug(B,j) 
B.aj 


and further these should remain invariant under any coset operations of the 
space group of the crystal with respect to the translational group 7. 

From the general method of application of group theory to the vibrations 
of a dynamical system in a mode of oscillation S(k,7) coming under an irre- 
ducible representation I'(k,z), the coefficient of the Cartesian co-ordinate 
u,(a@,i) is proportional to the amplitude of displacement of (a,i)th particle 
in the p-direction at any instant of time. If the proportionality parameter 
is taken to be S(k,i), which is purely a function of time, the displacement 
u,(a,i) = S(k,i)C where C is the coefficient of u,(a,i) in S(K,i). Similarly, if 
n(k,z) symmetry modes of oscillations come under the first row of the 
irreducible representation I'(k,i), we take n(k,7) different values of S,(k,7) 
corresponding to these symmetry co-ordinates. Then each particle is given 
simultaneously the displacements appropriate to all these »(k,7) symmetry 
co-ordinates, and the general equations of motion are reduced. 

Let us apply this method corresponding to the irreducible representation 
I'(k,1) of the space group G at a general wave vector k. Now, Gk = 7 and 


(5) W,(a,k) = >> up(a,j) (exp — 27ik- ty), 
j 


and a = 1,..,s5; p = 1,2,3, are the 3s symmetry modes of vibrations of the 
crystal that come under the first row of the representation I'(k,1). Introducing 
W,(a,k), the displacement u,(a,k) is given by 
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uy(a,j) = W,(a,k) (exp—2zik.t,). 


(5a) 


Substituting (5a) in equation (4) we get 


(6) mel p(ak) = 3 Coe(a8.§ — 1) (exo — 2aik . (t — ))174(B.K). 
9t- 


The above 38s simultaneous equations for the 3s translational symmetry 
modes of the s atoms in a unit cell turn out to be identical with the equations 
deduced by Born and von Karman (Born and Huang 1954, p. 224) based on 
the assumption that the solution consists of elastic waves of the form u,(a,i) = 
W,(a,k) (exp—2zik.t,;). But no such assumption is made here except the 
assumption that the forces are linear and the symmetry of the dynamical 
system is a space group; hence this procedure independently justifies their 
method. As we are dealing directly from the beginning with symmetry modes 
of vibrations, this method has another merit in that it makes the normal 
modes of vibrations of crystals quite obvious. We further note that the dis- 
placements defined by (5) are invariant when the wave vector k is increased 
by a full translational vector k’ = 27(h,B,+/2B2+h3B;3) in the reciprocal 
space where /j,/o,h3 are integers and A;.B; = 6;,;, for k’.t; equals 27 times 
an integer. This is as it should be, for all equivalent wave vectors in the 
reciprocal space give rise to the same irreducible representation of the group 7. 
The equations of motion (6) of the atoms in a crystal are given at a general 
wave vector kK in the reciprocal lattice. If k is on any symmetry element of 
the reciprocal space the equations can be reduced still to lower order. The 
method of reduction is quite similar to that in the case of molecules and is 
effected by calculating UDU* where D is the dynamical matrix, U the trans- 
formation matrix of the normalized symmetry co-ordinates, and U+ the com- 
plex conjugate of the transpose matrix of U (Wilson et al. 1955, p. 1382). We 
illustrate the method by applying it in the case of the diamond lattice. 


TABLE VII 


Relations between F, G etc. for the first wave vectors of the stars a, } etc. 





Stars Relations between F, G etc. Stars Relations between F, G etc. 
a F=G =H=—A {| f F=G=H 
J=K=L=B2CezE =0 | Jz=K2=L 
B=C=E 
b G=zH 
J=K=LzAz=z=B2C=z0 g F=H 
—J=Ke=L 
c F=2=Gz=H | —-B=E 
JxeKe2tL 
Bz=Ce-=E | h G=H 
J=KzL=eB2C=0 
d F=H 
K=L 1 No relations 
B=-E 
j G=H 
e F=G=H B=-C 
J=z=Ke=2 -L 
B=C=-E k F=G 
K=L 
C2xzE 


No relations 
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TABLE VIII 
Dynamical matrices for different ‘symmetry co-ordinates 





Stars _Dy namical matrices 





a 0, 2F 
b F 
GFE 
c F+2J+(A’+2B’) 
F — J+(A’— B’) 
d (F+G) 
F 3(B — iC) ) 
33(B*+iC*) 3(F+G) 
e FA” 


F+E"” iA” 
—iA” F — E” 


J F+2J A+2B 
A*42B* F+42/ 


as -—J A—B 
A* — B* F-J 
g = Cc 
* r: | 
ve F 


h 0 
G+(A+E)"” 
G+t(A — E)” 


rc Qwy 


FA’ T(J+K4AB*+C) A3(J+K+4B+C*) 
J(J+K+£B+C*) 1(G+H+2E’) L+A’ ) 


S(I+K +B*+C) L+A’ }(G+H+2E’) 


FFA" iB" +iB” 
+iB” G2+A"” LXE” 
-iB" L+E” G+A"” 


“E 4c A 
Xe pe V2C* FJ 2K 
V2K 


2° Ae K H 


( 
( : 
€: 45 A+B V/2C 
(! 


ns -—B 
— B* aj 


l Matrix is given by the equations (7) 


The equations of motion at a general wave vector k in the case of diamond 
lattice are set up by Smith (1948). They are 


(7) ¥, BEATA RE 4 AN OO, 


Y.= See 455. 
Zi = KXi+L ¥i+HZ,+CX2+EY2+AZz, 
Xe = AX +BY .+-€2;+FXi14- JY. +-KzZi, 
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Ve = BXi+AVi+EZ:4+IX2+GY2+LZ, 
Ze = CX1+EYV1+AZ1+KX2+LY2+HZ,z, 


where (X1,¥1,Z:;) = W(1,k) and (X2,¥2,Z2) = W(2,k) as explained earlier 
and 


ll 


F = 2/m[2a+A{2—cos 2xx —cos22(y—2z)} 
+u{4—cos 2ry—cos 2xz—cos 24 (x —y) —cos 24(x—2)}], 
(8) G = 2/m[2a+A{2—cos 2ry—cos 27(z—x)} 
+nu{4—cos 2rz—cos 2xx —cos 24 (y—z) —cos 2x(y—x)}], 
H = 2/m[2a+ A{2—cos 2xz—cos 24(x—y)} 
+u{4—cos 2rx—cos 2ry—cos 24(s—x) —cos 2r(z—y)} ], 
= 2v/m[cos 2x(x—y) —cos 272], 
2v/m|cos 2x(z—x) —cos 2ry], 
= 2y/m[cos 2x(y—z) —cos 2rx], 
—a/m{[1 e724 28 4 gente) 
—B/m[1 —e-?* * — eV 4-2 FZ) 
—B/m{l —e~2r it — gtr iz e—2riv) 
—B/m[1 —e-2*!?— e- 28 V4 e2r iz], 


II 


ll ll 


maAwWR NS 
I 


at k = (k,,k,,k,) dropping k. Before the reduction of the dynamical matrix 
is attempted it is advantageous to obtain the various relations between 
F, G, etc., for different wave vectors k. (For example, when k = (k,, k,, k,), 
it is easily seen, making use of (8), that F= G= H,J=K=L,andB=C 
= E.) All the results in this direction are given in Table VII which have been 
calculated for the first wave vectors of the stars as given in the International 
Tables under O}. 

Now, to illustrate the general method of reduction of the dynamical matrix 
when the wave vector k is on a symmetry element of the reciprocal lattice 
O}, consider the first wave vector of the star d. Its associated group G* has 
two allowable representations [,; and Ip. The symmetry co-ordinates that 
belong to the first row of the representation Tg are given by X,, Y2+7Z. 
from Table VI. When normalized with respect to Xy, Yi, 21, X2, Yo, Z2, they 
become Xy, (1/+/2)(VY2+7Z:s) respectively. So, the matrix U in this case is 


given by 


Sake 0 0 
~ | 6. tafe eT 


The dynamical matrix D involving only X,, Yo, and Z:2 is 


F 2 € 
Dai F G@& & %, 
i Se 


which is obtained from (7) after setting Y; = Z,; = X2 = 0 in the expressions 
of X1, Yo, and Z2. Now, calculating UDU*t we get 
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B-iC | 
1 —_—= 
2 


B*+iC* G+H 


/2 2 
| 


(In this case no further simplification is possible by making use of the results 
under the star d in Table VII except putting HT = F.) Similarly, the dynamical 
matrix can be reduced in other cases as well and the results are given in 
Table VIII in the same order as the symmetry co-ordinates are given in 
Table VI. In Table VIII, A’ is the real part of A, and A” and A’” are 
the real and imaginary parts of A exp wik.t. 

The dynamical matrices which are given in Table VIII are all Hermitian 
matrices and hence their characteristic roots are real. 
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NOTES 


ON THE ASSOCIATION OF RADIO REFLECTIONS WITH OPTICAL AURORA 
P. A. ForsytTuH 


There is a growing body of evidence which indicates that the statistical 
characteristics of radio aurora are grossly similar to those of optical aurora 
(see, for example, Forsyth, Green, and Mah 1960; Green 1961; Hector and 
Forsyth 1961). The inherent difficulties associated with the optical observations 
and the resulting paucity of optical data make it impossible to decide whether 
the remaining statistical differences are due to real differences between the 
two phenomena or to inadequacies in the statistical description of the optical 
aurora. 

Previous attempts to establish a temporal association between optical and 
radio aurora have depended upon an “‘event’’ type of analysis in which the 
occurrence or non-occurrence of each event within a specified period (usually 
1 hour), within a specified area (usually several hundreds of kilometers in 
extent), is used. Although these analyses have shown a significant degree 
of association, they do not constitute convincing evidence that the radio 
reflections actually arise within the optical aurora. 

A more specific investigation was attempted recently at Saskatoon. In this 
study the fluctuations in amplitude of the scattered radio signal were com- 
pared with the fluctuations in brightness of the aurora. The radio observations 
were made using a bistatic system of the type described by Collins and Forsyth 
(1959). The transmitter was located at Fort Churchill and the receiver at 
Saskatoon. The optical observations were made with an auroral intensity 
recorder which has been described by Hunten (1956). The recorder was 
modified to enlarge its field of view and the scanning mechanism was disabled 
so that the device, which was located at Saskatoon, could look continuously 
toward the radio path mid-point with an elevation angle of about 15°. The 
actual coverage in area achieved by this orientation is shown in Fig. 1, on the 
assumption that the auroral luminosity extended in height from 100 to 130 km. 
The radio sensitivity at the scattering height (about 100 km) varied con- 
siderably over the area of coverage, due to antenna gain variations. Contours 
of constant power sensitivity (for a small isotropic scatterer) are also shown 
in Fig. 1. It is evident that the area of optical coverage was less than half the 
area of radio coverage. During aurora, both the optical intensity and the 
radio signal intensity varied through large ranges and so each was recorded 
separately on a moving chart recorder using a logarithmic response with a 
range of about three decades. For monitoring purposes, the two signals were 
also recorded simultaneously on another recorder (again with logarithmic 
deflection sensitivity). A record from this recorder covering a few hours of the 
night of 14/15 April, 1961, is shown in Fig. 2. 
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Fic. 1. Plan view of the area of optical coverage (solid line) superimposed on contours of 
constant radio sensitivity (broken lines). The total radio path length is 1060 km, and, if com- 
pleted, the contours would be symmetric about the path mid-point. 


As a quantitative measure of the degree of association between the optical 
and radio signal fluctuations, correlation coefficients were calculated for a 
number of series made up of pairs of values obtained at equal intervals of time. 
A preliminary inspection of the records indicated that a sampling interval of 
about eight minutes would be sufficient to ensure independence of successive 
samples. The periods initially selected for analysis included all those nights for 
which the optical records showed auroral activity and for which all-sky 
camera films indicated that the sky was clear. Some of these periods subse- 
quently were rejected because one or both of the signals was so great as to 
exceed the dynamic range of the recording apparatus, for an appreciable 
fraction of the period. Also, the data for each period were divided into two 
groups corresponding to the hours before and after local midnight (7 hours 
U.T.). This was done because the radio records often show different character- 
istics for the periods before and after midnight. 

Because of the very great amplitude range of the two variables it is difficult 
to assess the reliability of the correlation coefficients. The amplitude distri- 
butions are extremely skewed and so the variances are not descriptive of the 
signal fluctuations. For this reason the correlation coefficients were calculated 
both by the usual product moment method (P) and by the rank difference 
method (2). These are listed for 13 nights in Table I, together with the number 
of sample points (7) for each coefficient. The rank difference method of deter- 
mining correlation does tend to rectify the skewness of the amplitude distribu- 
tions and, for that reason, may be preferred. 

The correlation coefficients of Table I show considerable variation but no 
more than might be attributed to the difference between the optical and radio 
coverages, and to the comparatively lengthy period over which each co- 
efficient was derived. The occasional occurrence of correlation coefficients of 
0.6 or greater suggests that for the periods concerned most of the radio aurora 
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TABLE. 1 


Correlation coefficients for optical and radio signal intensities 


Before 7 hours After 7 hours 








Date Time (U.T.) n P R n P R 
Sept. 13, 1960 0435-0942 20 -0.10 —0.08 22 0.34 0.49 
Oct. 26, 1960 0112-1249 47 0.26 0.26 47 —0.02 0.19 
Oct. 28, 1960 0100-1300 48 0.55 0.50 48 O2Et 0.18 
Nov. 16, 1960 0004-1335 50 0.50 0.29 53 0.43 0.22 
Dec. 15, 1960 0104-1356 48 0.58 0.49 56 0.03 0.08 
Jan. 20, 1961 0028-1356 52 0.48 0.27 56 0.30 0.12 
Jan. 21, 1961 0035-1356 52 0.85 0.61 56 —0.01 0.26 
March 19, 1961 0404-0956 24 0.26 0.59 24 0.33 0.59 
March 20, 1961 0228-1204 37 0.34 0.62 41 -—0.25 —0.01 
April 11, 1961 0304-1120 32 0.67 0.66 35 0.00 0.48 
April 15, 1961 0304-1112 30 0.34 0.52 28 —0.01 0.32 
May 9, 1961 0328-1020 29 0.67 0.68 27 —0.16 —0.08 
May 20, 1961 0404-0956 24 Olk7 0.08 24 0.20 0.57 


must have been located within the zone of optical coverage and that there 
must have been a significant degree of correspondence between the variations 
of optical intensity and of radio signal strength. There is a tendency for the 
higher coefficients to occur in the period before local midnight. It may be that 
the higher coefficients are associated with radio aurora of the 42 type and the 
lower coefficients with the A; type (see Collins and Forsyth 1959). An attempt 
to verify this behavior by calculating the coefficients separately for the two 
types of radio signal gave inconclusive results, perhaps because of poor 
sampling in the case of the Az signals. 

The limitations of the present experiment preclude any definitive con- 
clusions but the results are consistent with the view that the terms “optical 
aurora” and “radio aurora”’ serve only to differentiate between two methods 
of observing the same phenomenon. Since one method of observation is 
sensitive to ionization (concentration of free electrons) and the other to 
atomic and molecular excitation, some observational differences should be 
expected. There seems to be no basis in the present measurements for suggest- 
ing that optical aurora and radio aurora are independent phenomena. 


The radio transmitter at Fort Churchill was operated by the staff of the 
Defence Research Northern Laboratory. The receiver and optical recorder 
were operated by Mr. D. Glass. The author is indebted to Mr. D. McCracken 
for much assistance with the analysis and computation. 
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DETERMINATION OF RADAR AURORAL HEIGHTS WITH THE PRINCE 
ALBERT RADAR 


G. E. K. Lockwoop 


In many radar studies of aurora, the height of the scattering region has 
been computed by combining a measured range with the corresponding 
elevation angle of the radar beam, or with some recognizable feature of the 
antenna pattern. Table I summarizes a number of such measurements that 
have been reported in the literature. 





TABLE I 
Frequency Height range Mean height 
(Mc/s) (km) (km) Reference 
106 108 Currie, Forsyth, Vawter (1953) 
74 100-125 Bullough and Kaiser (1954) 
400 90-160 Frickers, Ingalls, Stone, and Wang 
(1957) 
400 75-135 100-110 Presnell, Leadabrand, Dyce, Schlo- 
bohm, and Berg (1959) 
780 100-120 
55 110.9+3.1 Unwin (1959) 


The radar system at the Prince Albert Radar Laboratory (PARL) employs 
a parabolic antenna, 84 ft in diameter and having a beam-width of 2.5° at 
448 Mc/s. This radar, which is comparable in angular resolution with any 
that have been used before, was employed in a brief study of auroral scattering 
heights during March of 1960. Except for the scanning mode, the radar para- 
meters are those listed by Kelly et al. (1961). The antenna was made to scan 
from 0° to 20° in elevation in 20 seconds, then step 8° in azimuth and scan 
again. Nine scans covering 64° in azimuth were made, then the cycle was 
repeated. Since magnetic north is near 12° E. at Prince Albert, the scanning 
system was set to step from 340° to 44° in azimuth (32° W. of magnetic north 
to 32° E. of magnetic north). For any particular range from the radar, the 
difference in magnetic latitude between the bearing of magnetic north and the 
extremes of the step-scan (340° and 44°) is less than 1°. Also, for a particular 
ground range and height, the difference in aspect angle (Kelly et al. 1961) 
between the bearing of magnetic north and the extremes of the step-scan is 
less than 1°. Thus, in the analysis of the auroral echoes, all elevation scans 
were treated as if they occurred at a bearing of magnetic north. 


TABLE Il 


Times of observation 











Time 
Date (M.S.T.) No. of scans 
March 5, 1960 00:14 to 00:25 33 
March 8, 1960 6:19 to 6:27 22 
March 8, 1960 7:58 to 8:09 41 
March 10, 1960 1:19 to 1:44 76 
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The data (Table II) consist of 172 photographs of a Z-modulated display 
showing elevation versus slant range. The photographs were scaled to measure 
the slant range of the leading edge of the echoes to the nearest 15 km (0.1 
millisecond time delay) at intervals of 1° elevation. Krom the tabulated data, 
contours (Fig. 1) of constant occurrence of auroral echoes were drawn for the 


150 


HEIGHT - KM 





400 500 600 700 800 900 1000 


GROUND RANGE - KM 
Fic. 1. 


relative occurrence in the ratio of 1:2:4. The contours indicate that most of 
the echoes were produced in a region between 100 and 105 km in height. Super- 
imposed over the contours are curves of constant aspect angle, ¢, for the bearing 
of magnetic north. 

An attempt was made to reproduce the experimental contours by taking 
into account the distribution of visual aurora with respect to height and 
magnetic latitude, the power dependence in range, and the aspect sensitivity 
factor for the reflection point. For each of a series of points on the height- 
versus-ground range graph, a figure representing the relative occurrence of 
visual. aurora was multiplied by an aspect sensitivity factor and a factor 
representing the slant range power dependence. The distribution of visual 
aurora with respect to height and magnetic latitude used were those given 
by Stérmer (1955) and Vestine (1944) respectively. The normalized aspect 
sensitivity factor, 


r= eka Ree een ne 


has been discussed by Moorcroft (1960, 196la, 19616) and by Kelly et al. 
(1961). 

From the tabulated data, three contours, also in the ratio of 1:2:4, were 
drawn. The contours for which the aspect sensitivity parameter a?(h?—1) or 
L? is equal to 1 m? are shown in Fig. 2 for the range dependence of R-* and 
R-‘, Increasing the parameter L? to values greater than 1 tends to lower the 
height of the contours; decreasing L? to values less than 1 causes the contours 
to move to decreasing range, since the decreased aspect sensitivity factor 
does not offset the larger value of the range dependence factor at close ranges. 
Note that the change in the range dependence from R-* to R-* moves the 
contours out in range and better approximates the experimental contours. 
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As the echoes analyzed occurred after midnight, and can be classified as 
diffuse, the agreement between the experimental and computed contours 
supports the statement by Kelly et al. that the distribution of diffuse radar 
aurora is controlled primarily by aspect sensitivity. Further, as shown in 
Figs. 1 and 2, there is a statistical correspondence in height between diffuse 
radar aurora and visual aurora. It is not clear why a better fit between the 
experimental and computed contours should be obtained for an R-“* range 
dependence, but this too is in agreement with the findings of Kelly et al. 


The author wishes to acknowledge the assistance of his colleagues at the 
PARL in obtaining the radar data. This study was conducted during the 
author’s posting to the Institute of Upper Atmospheric Physics, University 


of Saskatchewan. 
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THE STRUCTURE OF MANGANESE ARSENIDE BETWEEN 40°C AND 120°C 
R. O. KORNELSEN 


MnAs undergoes a first-order transformation at about 40° C, and another 
transformation, apparently of second order, at 120°C. At each of these 
transformations there occurs a marked change in magnetic properties. Below 
40° C MnAs is ferromagnetic. Above 120° C it is paramagnetic, the suscepti- 
bility obeying a Curie-Weiss law, [1/x = C/(T — 6)], where C = 2.64 and 
6 = 290° K. Between 40° C and 120° C the susceptibility rises with increasing 
temperature. This behavior has been described as antiferromagnetism by 
Guillaud (1951). Neutron diffraction work by Bacon and Street (1955) and 
by Kasper (1961) and electron spin resonance measurements by Rodbell 
(1957), however, have failed to show any evidence for antiferromagnetism. 
Willis and Rooksby (1954) measured the lattice parameters over this tempera- 
ture range and ascribed to them the NiAs (B8) structure. Their results show a 
discontinuous change in the a-parameter and in the volume at the lower 
transformation, with thermal hysteresis. At about 120° C there is a change in 
slope of the a versus temperature curve suggesting a second-order transforma- 
tion. The mechanism of the first-order transformation has been described in 
detail by Basinski and Pearson (1958). 

Because of the apparent connection between lattice spacings and magnetic 
properties in MnAs, and because of the doubt about the magnetic structure in 
the intermediate region, X-ray diffraction measurements have been carried 
out in this laboratory. The measurements were made with a 19-cm camera on 
a powdered specimen. The results are consistent with a NiAs structure below 
30° C and above 120° C. The lattice parameters are in good agreement with 
those of Willis and Rooksby. In the intermediate region, however, the lines 
on the well-resolved powder photograph show a very evident splitting. This 
indicates a departure from the hexagonal NiAs structure. The magnitude of 
the splitting is greatest just above 40° C and decreases as the temperature 
approaches 120° C. 

In order to account for the splitting, it was assumed that the hexagonal 
lattice was distorted by an expansion or a contraction in the [100] direction. 
Since the splitting of the lines was of the same order as the Ka,;—Kaz 
resolution, it was impossible unequivocably to index all the lines. However, 
the observed diffraction pattern was consistent with an orthorhombic cell 
with edges, a = 5.72! A, b = 3.668 A, and c = 6.367 A, at 51°C. 

Consideration of line splitting, line overlap, and Ka,;— Kaz doublet separa- 
tion indicated that diffractometer data collected for the 121 hexagonal reflec- 
tion would provide the most satisfactory (and perhaps the only) test of these 
conclusions. In Fig. 1 the diffractometer trace is compared with the splitting 
calculated from the above orthorhombic cell. The orthorhombic indices of the 
three lines are given. The number and positions of the peaks are in good 
agreement with the calculation, and also with the measured splitting on the 
film of the 401 hexagonal reflection. The intensities of the three peaks should 
be equal. That they are not suggests there is also some displacement of the 
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Fic. 1. A comparison of the diffractometer trace with the calculated splitting of the 
121 reflection. 


atoms from their positions. This is in agreement with recent work done by 
Kasper and Wilson (1961) on a single crystal of MnAs. If the orthorhombic cell 
described above is the true unit cell, there is a strong possibility that MnAs in 
the intermediate region has, in fact, the MnP(B31) structure which CrAs 
and FeAs also have. 

A number of substances including FeO, NiO, MnO, and CrSb undergo either 
an expansion or contraction perpendicular to the planes of aligned spins on 
becoming antiferromagnetic. It is possible that the distortion in MnAs has a 
similar connection with the magnetic properties. Indeed antiferromagnetism 
between 120° and 40° would account for the first-order transformation when 
MnAs goes ferromagnetic at 40°. 


I wish to thank Dr. W. B. Pearson for suggesting the problem and Dr. 
Z. S. Basinski for his help in making the diffractometer measurements. I am 
also grateful to Drs. J. Kasper and D. S. Rodbell for discussion of their work 
prior to publication. 
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TRANSITION CURVES IN LEAD FOR U-MESON INDUCED SHOWERS* 
MARGARET D. WILSON AND I. B. McDIARMID 


The characteristics of the so-called Rossi transition curves showing the 
rate of occurrence of electromagnetic cascades as a function of the thickness 
of the absorbing material have been investigated by a number of authors (see 
Hodge and Morris 1956). Included in these curves are showers produced by 
the soft component (electrons and photons) of the cosmic radiation incident 
on the absorber, and a much smaller number of showers arising from the 
electromagnetic interactions of u-mesons within the absorber. In general, the 
previous experiments have been concerned only with the shape of the composite 
curves and have not considered the relative contributions of the two types of 
showers. An empirical method of subtracting the showers induced by u-mesons 
was used by Carmichael (1957) in determining the energy spectrum of the 
soft component near sea level. 

In the course of a cloud chamber experiment designed to study the electro- 
magnetic interactions of high energy cosmic-ray u-mesons, a family of transition 
curves has been obtained for the showers resulting from these interactions. 
The experimental arrangement is shown in Fig. 1. A description of the cloud 
chamber and most of the associated equipment has been given previously 
(McDiarmid 1958). The 80-cm block of lead above the chamber removed 
essentially all of the incident particles except u-mesons of energy greater than 
1.2 Bev. The scintillators determined the number of electrons in showers 
under various depths of lead and also served as part of a selection system by 
requiring the passage of at least nine relativistic particles before an expansion 
occurred. The selection criterion can be expressed as follows: 


Gi + (G2 = 1) + [Si > 9 or S2 > 9] 


where the second term means the discharge of only one counter in Gz. A 
scaler recorded the number of acceptable yu-mesons passing through the 
apparatus, i.e. those satisfying the condition 


Gi+ (Ge = 1) + (Si = 1). 


The equipment operated for 6600 hours during which time 5.3 X 10° u-mesons 
were recorded by the scaler and 4100 photographs were taken. One thousand 
three hundred and fourteen photographs showed showers produced inside the 
chamber. 

Transition curves were obtained by taking various levels in the chamber, 
each corresponding to a fixed thickness of lead above one of the scintillators, 
and determining the frequency of events in which a single track was present 
at this level as a function of pulse height in the scintillator. Allowance was 
made for the geometry of the apparatus and the total sensitive time of the 
chamber to obtain the probability per u-meson for the production of bursts 
having more than N electrons below various thicknesses of lead. 


*Issued as N.R.C. No. 6533. 
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Fic. 1. Experimental arrangement. 


The results obtained from showers originating inside the chamber were 
used to plot the transition curves up to 5.3 cm of lead (the lead equivalent 
of the material in the chamber above S2). In order to extend the curves to 
greater thicknesses, additional lead was inserted between the top of the 
chamber and the lower Geiger tray. Three different thicknesses of lead were 
used, and the apparatus was operated for approximately two months with 
each of these absorbers, without any change in the selection criteria. The 
points on the transition curves between 6 cm and 17 cm of lead were obtained 
by adding the numbers of events originating in the lead above the chamber 
to those which occurred in the plates inside. Finally the points at 80 cm were 
obtained by removing the single particle selector from the control system 
and allowing the chamber to expand whenever one or more particles passed 
through the Geiger counter telescope and at least nine particles were recorded 
in one of the scintillators. 
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The transition curves for showers with N or more electrons produced as a 
result of electromagnetic interactions of u-mesons are shown in Fig. 2 for 
values of NV between 15 and 50 electrons. After the initial rapid increase with 
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Fic. 2. Transition curves showing the probability for the production of bursts having NV 
or more electrons as a function of thickness of lead. Note the change of scale on the abscissa 
at 20 cm of lead. 


increasing thickness of lead, the probability for the production of bursts 
increases slowly with thickness. There is a change of scale in the figure at 
20 cm of lead and beyond this point each of the curves has been projected so 
that its slope remains constant. At 80 cm, the points obtained from the upper 
scintillator, S;, are consistently higher than those obtained from the lower 
scintillator, S2, probably because of some small systematic error. There is no 
definite indication of a second maximum around 15 cm as has been reported 
by a numberof authors. 
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